Configurable memory circuit system and method

ABSTRACT

A memory circuit system and method are provided in the context of various embodiments. In one embodiment, an interface circuit remains in communication with a plurality of memory circuits and a system. The interface circuit is operable to interface the memory circuits and the system for performing various functionality (e.g. power management, simulation/emulation, etc.).

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application is a continuation of, and claims priority to, U.S. application Ser. No. 14/090,342, for “CONFIGURABLE MEMORY CIRCUIT SYSTEM AND METHOD,” to Suresh Rajan, Keith R. Schakel, Michael John Sebastian Smith, David T. Wang, and Frederick Daniel Weber, filed on Nov. 26, 2013, now U.S. Pat. No. 9,171,585, which is incorporated herein by reference in its entirety.

U.S. application Ser. No. 14/090,342 is a continuation-in-part of U.S. application Ser. No. 13/367,182, filed Feb. 6, 2012, now U.S. Pat. No. 8,868,829, which is a continuation of U.S. application Ser. No. 11/929,636 filed Oct. 30, 2007, now U.S. Pat. No. 8,244,971, which is a continuation of PCT application serial no. PCT/US2007/016385 filed Jul. 18, 2007, which is a continuation-in-part of each of U.S. application Ser. No. 11/461,439, filed Jul. 31, 2006, now U.S. Pat. No. 7,580,312, U.S. application Ser. No. 11/524,811, filed Sep. 20, 2006, now U.S. Pat. No. 7,590,796, U.S. application Ser. No. 11/524,730, filed Sep. 20, 2006, now U.S. Pat. No. 7,472,220, U.S. application Ser. No. 11/524,812 filed Sep. 20, 2006, now U.S. Pat. No. 7,386,656, U.S. application Ser. No. 11/524,716, filed Sep. 20, 2006, now U.S. Pat. No. 7,392,338, U.S. application Ser. No. 11/538,041, filed Oct. 2, 2006, now abandoned, U.S. application Ser. No. 11/584,179, filed Oct. 20, 2006, now U.S. Pat. No. 7,581,127, U.S. application Ser. No. 11/762,010, filed Jun. 12, 2007, now U.S. Pat. No. 8,041,881, and U.S. application Ser. No. 11/762,013, filed Jun. 12, 2007, now U.S. Pat. No. 8,090,897, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 12/507,682 filed on Jul. 22, 2009, now U.S. Pat. No. 8,949,519, which is a continuation of U.S. application Ser. No. 11/461,427, filed Jul. 31, 2006, now U.S. Pat. No. 7,609,567, which is a continuation-in-part of U.S. application Ser. No. 11/474,075 filed Jun. 23, 2006 now U.S. Pat. No. 7,515,453 which claims benefit of U.S. provisional application 60/693,631 filed Jun. 24, 2005, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 11/672,921 filed on Feb. 8, 2007, which claims the benefit of U.S. provisional application 60/722,414, filed Feb. 9, 2006 and U.S. provisional application 60/865,624 filed Nov. 13, 2006 and which is a continuation-in-part of each of: U.S. application Ser. No. 11/461,437 filed Jul. 31, 2006 now U.S. Pat. No. 8,077,535; U.S. application Ser. No. 11/702,981 filed Feb. 5, 2007 now U.S. Pat. No. 8,089,795; and U.S. application Ser. No. 11/702,960 filed Feb. 5, 2007, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/620,425, filed on Sep. 14, 2012, now U.S. Pat. No. 8,797,779, which is a continuation of U.S. application Ser. No. 13/341,844, filed on Dec. 30, 2011, now U.S. Pat. No. 8,566,556, which is a divisional of U.S. application Ser. No. 11/702,981, filed on Feb. 5, 2007 now U.S. Pat. No. 8,089,795, which claims the benefit of U.S. provisional application 60/865,624, filed Nov. 13, 2006, and claims the benefit of U.S. provisional application 60/772,414, filed on Feb. 9, 2006. U.S. application Ser. No. 11/702,981 is also a continuation-in-part of U.S. application Ser. No. 11/461,437, filed Jul. 31, 2006 now U.S. Pat. No. 8,077,535, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/615,008, now U.S. Pat. No. 8,631,220, filed on Sep. 13, 2012, which is a continuation application of U.S. application Ser. No. 11/939,440, filed Nov. 13, 2007, now U.S. Pat. No. 8,327,104, which is continuation-in-part of U.S. application Ser. No. 11/524,811, filed Sep. 20, 2006, now U.S. Pat. No. 7,590,796, which is a continuation-in-part of U.S. application Ser. No. 11/461,439, filed Jul. 31, 2006, now U.S. Pat. No. 7,580,312. U.S. application Ser. No. 11/939,440, also claims the benefit of priority to U.S. provisional application 60/865,627, filed Nov. 13, 2006, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/618,246, now U.S. Pat. No. 8,615,679, filed on Sep. 13, 2012, which is a continuation of U.S. patent application Ser. No. 13/280,251, filed Oct. 24, 2011, now U.S. Pat. No. 8,386,833, which is continuation of U.S. patent application Ser. No. 11/763,365, filed Jun. 14, 2007, now U.S. Pat. No. 8,060,774, which is a continuation-in part of U.S. patent application Ser. No. 11/474,076, filed on Jun. 23, 2006, which claims the benefit of U.S. provisional patent application 60/693,631, filed on Jun. 24, 2005. U.S. patent application Ser. No. 11/763,365 is also a continuation-in-part of U.S. patent application Ser. No. 11/515,223, filed on Sep. 1, 2006, now U.S. Pat. No. 8,619,452, which claims the benefit of U.S. provisional patent application 60/713,815, filed on Sep. 2, 2005. U.S. patent application Ser. No. 11/763,365 also claimed the benefit of U.S. provisional patent application 60/814,234, filed on Jun. 16, 2006, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/620,565, filed on Sep. 14, 2012, now U.S. Pat. No. 8,811,065, which is a continuation of U.S. application Ser. No. 11/515,223, filed on Sep. 1, 2006, now U.S. Pat. No. 8,619,452, which claims the benefit of U.S. provisional patent application 60/713,815, filed Sep. 2, 2005, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/620,645, filed on Sep. 14, 2012, now abandoned, which is a continuation of U.S. application Ser. No. 11/929,655, filed on Oct. 30, 2007, now abandoned, which is a continuation of U.S. application Ser. No. 11/828,181, filed on Jul. 25, 2007, now abandoned, which claims the benefit of U.S. provisional application 60/823,229, filed Aug. 22, 2006, and which is a continuation-in-part of U.S. application Ser. No. 11/584,179, filed on Oct. 20, 2006, now U.S. Pat. No. 7,581,127, which is a continuation of U.S. application Ser. No. 11/524,811, filed on Sep. 20, 2006, now U.S. Pat. No. 7,590,796, and is a continuation-in-part of U.S. application Ser. No. 11/461,439, filed on Jul. 31, 2006, now U.S. Pat. No. 7,580,312, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/473,827, filed May 17, 2012, now U.S. Pat. No. 8,631,193, which is a divisional of U.S. application Ser. No. 12/378,328, filed Feb. 14, 2009, now U.S. Pat. No. 8,438,328, which claims the benefit of U.S. provisional application 61/030,534, filed on Feb. 21, 2008, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/620,793, filed on Sep. 15, 2012, now U.S. Pat. No. 8,977,806, which is a continuation of U.S. application Ser. No. 12/057,306, filed Mar. 27, 2008, now U.S. Pat. No. 8,397,013, which is a continuation-in-part of U.S. application Ser. No. 11/611,374, filed on Dec. 15, 2006, now U.S. Pat. No. 8,055,833, which claims the benefit of U.S. provisional application 60/849,631, filed Oct. 5, 2006, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/620,424, filed on Sep. 14, 2012, now U.S. Pat. No. 8,751,732, which is a continuation of U.S. application Ser. No. 13/276,212, filed Oct. 18, 2011, now U.S. Pat. No. 8,370,566, which is a continuation of U.S. application Ser. No. 11/611,374, filed Dec. 15, 2006, now U.S. Pat. No. 8,055,833, which claims the benefit of U.S. provisional application 60/849,631, filed Oct. 5, 2006, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/597,895, filed Aug. 29, 2012, now U.S. Pat. No. 8,675,429, which is a continuation of U.S. application Ser. No. 13/367,259, filed Feb. 6, 2012, now U.S. Pat. No. 8,279,690, which is a divisional of U.S. application Ser. No. 11/941,589, filed Nov. 16, 2007, now U.S. Pat. No. 8,111,566, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/455,691, filed Apr. 25, 2012, now U.S. Pat. No. 8,710,862, which is a continuation of U.S. patent application Ser. No. 12/797,557 filed Jun. 9, 2010, now U.S. Pat. No. 8,169,233, which claims the benefit of U.S. provisional application 61/185,585, filed on Jun. 9, 2009, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/620,412, filed Sep. 14, 2012, now U.S. Pat. No. 8,705,240, which is a continuation of U.S. patent application Ser. No. 13/279,068, filed Oct. 21, 2011, now U.S. Pat. No. 8,730,670, which is a divisional of U.S. patent application Ser. No. 12/203,100, filed Sep. 2, 2008, now U.S. Pat. No. 8,081,474, which claims the benefit of U.S. provisional application 61/014,740, filed Dec. 18, 2007, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/898,002, filed May 20, 2013, now U.S. Pat. No. 8,760,936, which is a continuation of U.S. application Ser. No. 13/411,489, filed Mar. 2, 2012, now U.S. Pat. No. 8,446,781, which is a continuation of U.S. application Ser. No. 11/939,432, filed Nov. 13, 2007, now U.S. Pat. No. 8,130,560, which claims the benefit of U.S. provisional application 60/865,623, filed Nov. 13, 2006, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 11/515,167, filed Sep. 1, 2006, now U.S. Pat. No. 8,796,830, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/620,199, filed Sep. 14, 2012, now U.S. Pat. No. 8,762,675, which is a continuation of U.S. application Ser. No. 12/144,396, filed Jun. 23, 2008, now U.S. Pat. No. 8,386,722, each of which is incorporated herein by reference.

U.S. application Ser. No. 14/090,342 is also a continuation-in-part of U.S. application Ser. No. 13/620,207, filed Sep. 14, 2012, now U.S. Pat. No. 8,819,356, which is a continuation of U.S. application Ser. No. 12/508,496, filed Jul. 23, 2009, now U.S. Pat. No. 8,335,894, which claims the benefit of U.S. provisional application 61/083,878, filed Jul. 25, 2008, each of which is incorporated herein by reference.

BACKGROUND AND FIELD OF THE INVENTION

This invention relates generally to memory.

SUMMARY

In one embodiment, a memory subsystem is provided including an interface circuit adapted for coupling with a plurality of memory circuits and a system. The interface circuit is operable to interface the memory circuits and the system for emulating at least one memory circuit with at least one aspect that is different from at least one aspect of at least one of the plurality of memory circuits. Such aspect includes a signal, a capacity, a timing, and/or a logical interface.

In another embodiment, a memory subsystem is provided including an interface circuit adapted for communication with a system and a majority of address or control signals of a first number of memory circuits. The interface circuit includes emulation logic for emulating at least one memory circuit of a second number.

In yet another embodiment, a memory circuit power management system and method are provided. In use, an interface circuit is in communication with a plurality of physical memory circuits and a system. The interface circuit is operable to interface the physical memory circuits and the system for simulating at least one virtual memory circuit with a first power behavior that is different from a second power behavior of the physical memory circuits.

In still yet another embodiment, a memory circuit power management system and method are provided. In use, an interface circuit is in communication with a plurality of memory circuits and a system. The interface circuit is operable to interface the memory circuits and the system for performing a power management operation in association with at least a portion of the memory circuits. Such power management operation is performed during a latency associated with one or more commands directed to at least a portion of the memory circuits.

In even another embodiment, an apparatus and method are provided for communicating with a plurality of physical memory circuits. In use, at least one virtual memory circuit is simulated where at least one aspect (e.g. power-related aspect, etc.) of such virtual memory circuit(s) is different from at least one aspect of at least one of the physical memory circuits. Further, in various embodiments, such simulation may be carried out by a system (or component thereof), an interface circuit, etc.

In another embodiment, an power saving system and method are provided. In use, at least one of a plurality of memory circuits is identified that is not currently being accessed. In response to the identification of the at least one memory circuit, a power saving operation is initiated in association with the at least one memory circuit.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a system coupled to multiple memory circuits and an interface circuit according to one embodiment of this invention.

FIG. 2 shows a buffered stack of DRAM circuits each having a dedicated data path from the buffer chip and sharing a single address, control, and clock bus.

FIG. 3 shows a buffered stack of DRAM circuits having two address, control, and clock busses and two data busses.

FIG. 4 shows a buffered stack of DRAM circuits having one address, control, and clock bus and two data busses.

FIG. 5 shows a buffered stack of DRAM circuits having one address, control, and clock bus and one data bus.

FIG. 6 shows a buffered stack of DRAM circuits in which the buffer chip is located in the middle of the stack of DRAM chips.

FIG. 7 is a flow chart showing one method of storing information.

FIG. 8 shows a high capacity DIMM using buffered stacks of DRAM chips according to one embodiment of this invention.

FIG. 9 is a timing diagram showing one embodiment of how the buffer chip makes a buffered stack of DRAM circuits appear to the system or memory controller to use longer column address strobe (CAS) latency DRAM chips than is actually used by the physical DRAM chips.

FIG. 10 shows a timing diagram showing the write data timing expected by DRAM in a buffered stack, in accordance with another embodiment of this invention.

FIG. 11 is a timing diagram showing how write control signals are delayed by a buffer chip in accordance with another embodiment of this invention.

FIG. 12 is a timing diagram showing early write data from a memory controller or an advanced memory buffer (AMB) according to yet another embodiment of this invention.

FIG. 13 is a timing diagram showing address bus conflicts caused by delayed write operations.

FIG. 14 is a timing diagram showing variable delay of an activate operation through a buffer chip.

FIG. 15 is a timing diagram showing variable delay of a precharge operation through a buffer chip.

FIG. 16 shows a buffered stack of DRAM circuits and the buffer chip which presents them to the system as if they were a single, larger DRAM circuit, in accordance with one embodiment of this invention.

FIG. 17 is a flow chart showing a method of refreshing a plurality of memory circuits, in accordance with one embodiment of this invention.

FIG. 18 shows a block diagram of another embodiment of the invention.

FIG. 19 illustrates a multiple memory circuit framework, in accordance with one embodiment.

FIGS. 20A-E show a stack of dynamic random access memory (DRAM) circuits that utilize one or more interface circuits, in accordance with various embodiments.

FIGS. 21A-D show a memory module which uses dynamic random access memory (DRAM) circuits with various interface circuits, in accordance with different embodiments.

FIGS. 22A-E show a memory module which uses DRAM circuits with an advanced memory buffer (AMB) chip and various other interface circuits, in accordance with various embodiments.

FIG. 23 shows a system in which four 512 Mb DRAM circuits are mapped to a single 2 Gb DRAM circuit, in accordance with yet another embodiment.

FIG. 24 shows a memory system comprising FB-DIMM modules using DRAM circuits with AMB chips, in accordance with another embodiment.

The present application is a continuation of U.S. application Ser. No. 14/090,342, filed Nov. 26, 2013, which is incorporated herein by reference in its entirety.

DETAILED DESCRIPTION

Various embodiments are set forth below. It should be noted that the claims corresponding to each of such embodiments should be construed in terms of the relevant description set forth herein. If any definitions, etc. set forth herein are contradictory with respect to terminology of certain claims, such terminology should be construed in terms of the relevant description.

The present application is a continuation of U.S. application Ser. No. 14/090,342, filed Nov. 26, 2013, which is incorporated herein by reference in its entirety.

FIG. 1 illustrates a system 100 including a system device 106 coupled to an interface circuit 102, which is in turn coupled to a plurality of physical memory circuits 104A-N. The physical memory circuits may be any type of memory circuits. In some embodiments, each physical memory circuit is a separate memory chip. For example, each may be a DDR2 DRAM. In some embodiments, the memory circuits may be symmetrical, meaning each has the same capacity, type, speed, etc., while in other embodiments they may be asymmetrical. For ease of illustration only, three such memory circuits are shown, but actual embodiments may use any plural number of memory circuits. As will be discussed below, the memory chips may optionally be coupled to a memory module (not shown), such as a DIMM.

The system device may be any type of system capable of requesting and/or initiating a process that results in an access of the memory circuits. The system may include a memory controller (not shown) through which it accesses the memory circuits.

The interface circuit may include any circuit or logic capable of directly or indirectly communicating with the memory circuits, such as a buffer chip, advanced memory buffer (AMB) chip, etc. The interface circuit interfaces a plurality of signals 108 between the system device and the memory circuits. Such signals may include, for example, data signals, address signals, control signals, clock signals, and so forth. In some embodiments, all of the signals communicated between the system device and the memory circuits are communicated via the interface circuit. In other embodiments, some other signals 110 are communicated directly between the system device (or some component thereof, such as a memory controller, an AMB, or a register) and the memory circuits, without passing through the interface circuit. In some such embodiments, the majority of signals are communicated via the interface circuit, such that L>M.

As will be explained in greater detail below, the interface circuit presents to the system device an interface to emulated memory devices which differ in some aspect from the physical memory circuits which are actually present. For example, the interface circuit may tell the system device that the number of emulated memory circuits is different than the actual number of physical memory circuits. The terms “emulating”, “emulated”, “emulation”, and the like will be used in this disclosure to signify emulation, simulation, disguising, transforming, converting, and the like, which results in at least one characteristic of the memory circuits appearing to the system device to be different than the actual, physical characteristic. In some embodiments, the emulated characteristic may be electrical in nature, physical in nature, logical in nature (e.g. a logical interface, etc.), pertaining to a protocol, etc. An example of an emulated electrical characteristic might be a signal, or a voltage level. An example of an emulated physical characteristic might be a number of pins or wires, a number of signals, or a memory capacity. An example of an emulated protocol characteristic might be a timing, or a specific protocol such as DDR3.

In the case of an emulated signal, such signal may be a control signal such as an address signal, a data signal, or a control signal associated with an activate operation, precharge operation, write operation, mode register read operation, refresh operation, etc. The interface circuit may emulate the number of signals, type of signals, duration of signal assertion, and so forth. It may combine multiple signals to emulate another signal.

The interface circuit may present to the system device an emulated interface to e.g. DDR3 memory, while the physical memory chips are, in fact, DDR2 memory. The interface circuit may emulate an interface to one version of a protocol such as DDR2 with 5-5-5 latency timing, while the physical memory chips are built to another version of the protocol such as DDR2 with 3-3-3 latency timing. The interface circuit may emulate an interface to a memory having a first capacity that is different than the actual combined capacity of the physical memory chips.

An emulated timing may relate to latency of e.g. a column address strobe (CAS) latency, a row address to column address latency (tRCD), a row precharge latency (tRP), an activate to precharge latency (tRAS), and so forth. CAS latency is related to the timing of accessing a column of data. tRCD is the latency required between the row address strobe (RAS) and CAS. tRP is the latency required to terminate an open row and open access to the next row. tRAS is the latency required to access a certain row of data between an activate operation and a precharge operation.

The interface circuit may be operable to receive a signal from the system device and communicate the signal to one or more of the memory circuits after a delay (which may be hidden from the system device). Such delay may be fixed, or in some embodiments it may be variable. If variable, the delay may depend on e.g. a function of the current signal or a previous signal, a combination of signals, or the like. The delay may include a cumulative delay associated with any one or more of the signals. The delay may result in a time shift of the signal forward or backward in time with respect to other signals. Different delays may be applied to different signals. The interface circuit may similarly be operable to receive a signal from a memory circuit and communicate the signal to the system device after a delay.

The interface circuit may take the form of, or incorporate, or be incorporated into, a register, an AMB, a buffer, or the like, and may comply with Joint Electron Device Engineering Council (JEDEC) standards, and may have forwarding, storing, and/or buffering capabilities.

In some embodiments, the interface circuit may perform operations without the system device's knowledge. One particularly useful such operation is a power-saving operation. The interface circuit may identify one or more of the memory circuits which are not currently being accessed by the system device, and perform the power saving operation on those. In one such embodiment, the identification may involve determining whether any page (or other portion) of memory is being accessed. The power saving operation may be a power down operation, such as a precharge power down operation.

The interface circuit may include one or more devices which together perform the emulation and related operations. The interface circuit may be coupled or packaged with the memory devices, or with the system device or a component thereof, or separately. In one embodiment, the memory circuits and the interface circuit are coupled to a DIMM.

FIG. 2 illustrates one embodiment of a system 200 including a system device (e.g. host system 204, etc.) which communicates address, control, clock, and data signals with a memory subsystem 201 via an interface.

The memory subsystem includes a buffer chip 202 which presents the host system with emulated interface to emulated memory, and a plurality of physical memory circuits which, in the example shown, are DRAM chips 206A-D. In one embodiment, the DRAM chips are stacked, and the buffer chip is placed electrically between them and the host system. Although the embodiments described here show the stack consisting of multiple DRAM circuits, a stack may refer to any collection of memory circuits (e.g. DRAM circuits, flash memory circuits, or combinations of memory circuit technologies, etc.).

The buffer chip buffers communicates signals between the host system and the DRAM chips, and presents to the host system an emulated interface to present the memory as though it were a smaller number of larger capacity DRAM chips, although in actuality there is a larger number of smaller capacity DRAM chips in the memory subsystem. For example, there may be eight 512 Mb physical DRAM chips, but the buffer chip buffers and emulates them to appear as a single 4 Gb DRAM chip, or as two 2 Gb DRAM chips. Although the drawing shows four DRAM chips, this is for ease of illustration only; the invention is, of course, not limited to using four DRAM chips.

In the example shown, the buffer chip is coupled to send address, control, and clock signals 208 to the DRAM chips via a single, shared address, control, and clock bus, but each DRAM chip has its own, dedicated data path for sending and receiving data signals 210 to/from the buffer chip.

Throughout this disclosure, the reference number 1 will be used to denote the interface between the host system and the buffer chip, the reference number 2 will be used to denote the address, control, and clock interface between the buffer chip and the physical memory circuits, and the reference number 3 will be used to denote the data interface between the buffer chip and the physical memory circuits, regardless of the specifics of how any of those interfaces is implemented in the various embodiments and configurations described below. In the configuration shown in FIG. 2, there is a single address, control, and clock interface channel 2 and four data interface channels 3; this implementation may thus be said to have a “1A4D” configuration (wherein “1 A” means one address, control, and clock channel in interface 2, and “4D” means four data channels in interface 3).

In the example shown, the DRAM chips are physically arranged on a single side of the buffer chip. The buffer chip may, optionally, be a part of the stack of DRAM chips, and may optionally be the bottommost chip in the stack. Or, it may be separate from the stack.

FIG. 3 illustrates another embodiment of a system 301 in which the buffer chip 303 is interfaced to a host system 304 and is coupled to the DRAM chips 307A-307D somewhat differently than in the system of FIG. 2. There are a plurality of shared address, control, and clock busses 309A and 309B, and a plurality of shared data busses 305A and 305B. Each shared bus has two or more DRAM chips coupled to it. As shown, the sharing need not necessarily be the same in the data busses as it is in the address, control, and clock busses. This embodiment has a “2A2D” configuration.

FIG. 4 illustrates another embodiment of a system 411 in which the buffer chip 413 is interfaced to a host system 404 and is coupled to the DRAM chips 417A-417D somewhat differently than in the system of FIG. 2 or 3. There is a shared address, control, and clock bus 419, and a plurality of shared data busses 415A and 415B. Each shared bus has two or more DRAM chips coupled to it. This implementation has a “1A2D” configuration.

FIG. 5 illustrates another embodiment of a system 521 in which the buffer chip 523 is interfaced to a host system 504 and is coupled to the DRAM chips 527A-527D somewhat differently than in the system of FIGS. 2 through 4. There is a shared address, control, and clock bus 529, and a shared data bus 525. This implementation has a “1A1D” configuration.

FIG. 6 illustrates another embodiment of a system 631 in which the buffer chip 633 is interfaced to a host system 604 and is coupled to the DRAM chips 637A-637D somewhat differently than in the system of FIGS. 2 through 5. There is a plurality of shared address, control, and clock busses 639A and 639B, and a plurality of dedicated data paths 635. Each shared bus has two or more DRAM chips coupled to it. Further, in the example shown, the DRAM chips are physically arranged on both sides of the buffer chip. There may be, for example, sixteen DRAM chips, with the eight DRAM chips on each side of the buffer chip arranged in two stacks of four chips each. This implementation has a “2A4D” configuration.

FIGS. 2 through 6 are not intended to be an exhaustive listing of all possible permutations of data paths, busses, and buffer chip configurations, and are only illustrative of some ways in which the host system device can be in electrical contact only with the load of the buffer chip and thereby be isolated from whatever physical memory circuits, data paths, busses, etc. exist on the (logical) other side of the buffer chip.

FIG. 7 illustrates one embodiment of a method 700 for storing at least a portion of information received in association with a first operation, for use in performing a second operation. Such a method may be practiced in a variety of systems, such as, but not limited to, those of FIGS. 1-6. For example, the method may be performed by the interface circuit of FIG. 1 or the buffer chip of FIG. 2.

Initially, first information is received (702) in association with a first operation to be performed on at least one of the memory circuits (DRAM chips). Depending on the particular implementation, the first information may be received prior to, simultaneously with, or subsequent to the instigation of the first operation. The first operation may be, for example, a row operation, in which case the first information may include e.g. address values received by the buffer chip via the address bus from the host system. At least a portion of the first information is then stored (704).

The buffer chip also receives (706) second information associated with a second operation. For convenience, this receipt is shown as being after the storing of the first information, but it could also happen prior to or simultaneously with the storing. The second operation may be, for example, a column operation.

Then, the buffer chip performs (708) the second operation, utilizing the stored portion of the first information, and the second information.

If the buffer chip is emulating a memory device which has a larger capacity than each of the physical DRAM chips in the stack, the buffer chip may receive from the host system's memory controller more address bits than are required to address any given one of the DRAM chips. In this instance, the extra address bits may be decoded by the buffer chip to individually select the DRAM chips, utilizing separate chip select signals (not shown) to each of the DRAM chips in the stack.

For example, a stack of four x4 1 Gb DRAM chips behind the buffer chip may appear to the host system as a single x4 4 Gb DRAM circuit, in which case the memory controller may provide sixteen row address bits and three bank address bits during a row operation (e.g. an activate operation), and provide eleven column address bits and three bank address bits during a column operation (e.g. a read or write operation). However, the individual DRAM chips in the stack may require only fourteen row address bits and three bank address bits for a row operation, and eleven column address bits and three bank address bits during a column operation. As a result, during a row operation (the first operation in the method 702), the buffer chip may receive two address bits more than are needed by any of the DRAM chips. The buffer chip stores (704) these two extra bits during the row operation (in addition to using them to select the correct one of the DRAM chips), then uses them later, during the column operation, to select the correct one of the DRAM chips.

The mapping between a system address (from the host system to the buffer chip) and a device address (from the buffer chip to a DRAM chip) may be performed in various manners. In one embodiment, lower order system row address and bank address bits may be mapped directly to the device row address and bank address bits, with the most significant system row address bits (and, optionally, the most significant bank address bits) being stored for use in the subsequent column operation. In one such embodiment, what is stored is the decoded version of those bits; in other words, the extra bits may be stored either prior to or after decoding. The stored bits may be stored, for example, in an internal lookup table (not shown) in the buffer chip, for one or more clock cycles.

As another example, the buffer chip may have four 512 Mb DRAM chips with which it emulates a single 2 Gb DRAM chip. The system will present fifteen row address bits, from which the buffer chip may use the fourteen low order bits (or, optionally, some other set of fourteen bits) to directly address the DRAM chips. The system will present three bank address bits, from which the buffer chip may use the two low order bits (or, optionally, some other set of two bits) to directly address the DRAM chips. During a row operation, the most significant bank address bit (or other unused bit) and the most significant row address bit (or other unused bit) are used to generate the four DRAM chip select signals, and are stored for later reuse. And during a subsequent column operation, the stored bits are again used to generate the four DRAM chip select signals. Optionally, the unused bank address is not stored during the row operation, as it will be re-presented during the subsequent column operation.

As yet another example, addresses may be mapped between four 1 Gb DRAM circuits to emulate a single 4 Gb DRAM circuit. Sixteen row address bits and three bank address bits come from the host system, of which the low order fourteen address bits and all three bank address bits are mapped directly to the DRAM circuits. During a row operation, the two most significant row address bits are decoded to generate four chip select signals, and are stored using the bank address bits as the index. During the subsequent column operation, the stored row address bits are again used to generate the four chip select signals.

A particular mapping technique may be chosen, to ensure that there are no unnecessary combinational logic circuits in the critical timing path between the address input pins and address output pins of the buffer chip. Corresponding combinational logic circuits may instead be used to generate the individual chip select signals. This may allow the capacitive loading on the address outputs of the buffer chip to be much higher than the loading on the individual chip select signal outputs of the buffer chip.

In another embodiment, the address mapping may be performed by the buffer chip using some of the bank address signals from the host system to generate the chip select signals. The buffer chip may store the higher order row address bits during a row operation, using the bank address as the index, and then use the stored address bits as part of the DRAM circuit bank address during a column operation.

For example, four 512 Mb DRAM chips may be used in emulating a single 2 Gb DRAM. Fifteen row address bits come from the host system, of which the low order fourteen are mapped directly to the DRAM chips. Three bank address bits come from the host system, of which the least significant bit is used as a DRAM circuit bank address bit for the DRAM chips. The most significant row address bit may be used as an additional DRAM circuit bank address bit. During a row operation, the two most significant bank address bits are decoded to generate the four chip select signals. The most significant row address bit may be stored during the row operation, and reused during the column operation with the least significant bank address bit, to form the DRAM circuit bank address.

The column address from the host system memory controller may be mapped directly as the column address to the DRAM chips in the stack, since each of the DRAM chips may have the same page size, regardless any differences in the capacities of the (asymmetrical) DRAM chips.

Optionally, address bit A[10] may be used by the memory controller to enable or disable auto-precharge during a column operation, in which case the buffer chip may forward that bit to the DRAM circuits without any modification during a column operation.

In various embodiments, it may be desirable to determine whether the simulated DRAM circuit behaves according to a desired DRAM standard or other design specification. Behavior of many DRAM circuits is specified by the JEDEC standards, and it may be desirable to exactly emulate a particular JEDEC standard DRAM. The JEDEC standard defines control signals that a DRAM circuit must accept and the behavior of the DRAM circuit as a result of such control signals. For example, the JEDEC specification for DDR2 DRAM is known as JESD79-2B. If it is desired to determine whether a standard is met, the following algorithm may be used. Using a set of software verification tools, it checks for formal verification of logic, that protocol behavior of the simulated DRAM circuit is the same as the desired standard or other design specification. Examples of suitable verification tools include: Magellan, supplied by Synopsys, Inc. of 700 E. Middlefield Rd., Mt. View, Calif. 94043; Incisive, supplied by Cadence Design Systems, Inc., of 2655 Sealy Ave., San Jose, Calif. 95134; tools supplied by Jasper Design Automation, Inc. of 100 View St. #100, Mt. View, Calif. 94041; Verix, supplied by Real Intent, Inc., of 505 N. Mathilda Ave. #210, Sunnyvale, Calif. 94085; 0-In, supplied by Mentor Graphics Corp. of 8005 SW Boeckman Rd., Wilsonville, Oreg. 97070; and others. These software verification tools use written assertions that correspond to the rules established by the particular DRAM protocol and specification. These written assertions are further included in the code that forms the logic description for the buffer chip. By writing assertions that correspond to the desired behavior of the emulated DRAM circuit, a proof may be constructed that determines whether the desired design requirements are met.

For instance, an assertion may be written that no two DRAM control signals are allowed to be issued to an address, control, and clock bus at the same time. Although one may know which of the various buffer chip/DRAM stack configurations and address mappings (such as those described above) are suitable, the verification process allows a designer to prove that the emulated DRAM circuit exactly meets the required standard etc. If, for example, an address mapping that uses a common bus for data and a common bus for address, results in a control and clock bus that does not meet a required specification, alternative designs for buffer chips with other bus arrangements or alternative designs for the sideband signal interconnect between two or more buffer chips may be used and tested for compliance. Such sideband signals convey the power management signals, for example.

FIG. 8 illustrates a high capacity DIMM 800 using a plurality of buffered stacks of DRAM circuits 802 and a register device 804, according to one embodiment of this invention. The register performs the addressing and control of the buffered stacks. In some embodiments, the DIMM may be an FB-DIMM, in which case the register is an AMB. In one embodiment the emulation is performed at the DIMM level.

FIG. 9 is a timing diagram illustrating a timing design 900 of a buffer chip which makes a buffered stack of DRAM chips mimic a larger DRAM circuit having longer CAS latency, in accordance with another embodiment of this invention. Any delay through a buffer chip may be made transparent to the host system's memory controller, by using such a method. Such a delay may be a result of the buffer chip being located electrically between the memory bus of the host system and the stacked DRAM circuits, since some or all of the signals that connect the memory bus to the DRAM circuits pass through the buffer chip. A finite amount of time may be needed for these signals to traverse through the buffer chip. With the exception of register chips and AMBs, industry standard memory protocols may not comprehend the buffer chip that sits between the memory bus and the DRAM chips. Industry standards narrowly define the properties of a register chip and an AMB, but not the properties of the buffer chip of this embodiment. Thus, any signal delay caused by the buffer chip may cause a violation of the industry standard protocols.

In one embodiment, the buffer chip may cause a one-half clock cycle delay between the buffer chip receiving address and control signals from the host system memory controller (or, optionally, from a register chip or an AMB), and the address and control signals being valid at the inputs of the stacked DRAM circuits. Data signals may also have a one-half clock cycle delay in either direction to/from the host system. Other amounts of delay are, of course, possible, and the half-clock cycle example is for illustration only.

The cumulative delay through the buffer chip is the sum of a delay of the address and control signals and a delay of the data signals. FIG. 9 illustrates an example where the buffer chip is using DRAM chips having a native CAS latency of i clocks, and the buffer chip delay is j clocks, thus the buffer chip emulates a DRAM having a CAS latency of i+j clocks. In the example shown, the DRAM chips have a native CAS latency 906 of four clocks (from t1 to t5), and the total latency through the buffer chip is two clocks (one clock delay 902 from t0 to t1 for address and control signals, plus one clock delay 904 from t5 to t6 for data signals), and the buffer chip emulates a DRAM having a six clock CAS latency 908.

In FIG. 9 (and other timing diagrams), the reference numbers 1, 2, and/or 3 at the left margin indicate which of the interfaces correspond to the signals or values illustrated on the associated waveforms. For example, in FIG. 9: the “Clock” signal shown as a square wave on the uppermost waveform is indicated as belonging to the interface 1 between the host system and the buffer chip; the “Control Input to Buffer” signal is also part of the interface 1; the “Control Input to DRAM” waveform is part of the interface 2 from the buffer chip to the physical memory circuits; the “Data Output from DRAM” waveform is part of the interface 3 from the physical memory circuits to the buffer chip; and the “Data Output from Buffer” shown in the lowermost waveform is part of the interface 1 from the buffer chip to the host system.

FIG. 10 is a timing diagram illustrating a timing design 1000 of write data timing expected by a DRAM circuit in a buffered stack. Emulation of a larger capacity DRAM circuit having higher CAS latency (as in FIG. 9) may, in some implementations, create a problem with the timing of write operations. For example, with respect to a buffered stack of DDR2 SDRAM chips with a read CAS latency of four clocks which are used in emulating a single larger DDR2 SDRAM with a read CAS latency of six clocks, the DDR2 SDRAM protocol may specify that the write CAS latency 1002 is one less than the read CAS latency. Therefore, since the buffered stack appears as a DDR2 SDRAM with a read CAS latency of six clocks, the memory controller may use a buffered stack write CAS latency of five clocks 1004 when scheduling a write operation to the memory.

In the specific example shown, the memory controller issues the write operation at t0. After a one clock cycle delay through the buffer chip, the write operation is issued to the DRAM chips at t1. Because the memory controller believes it is connected to memory having a read CAS latency of six clocks and thus a write CAS latency of five clocks, it issues the write data at time t0+5=t5. But because the physical DRAM chips have a read CAS latency of four clocks and thus a write CAS latency of three clocks, they expect to receive the write data at time t1+3=t4. Hence the problem, which the buffer chip may alleviate by delaying write operations.

The waveform “Write Data Expected by DRAM” is not shown as belonging to interface 1, interface 2, or interface 3, for the simple reason that there is no such signal present in any of those interfaces. That waveform represents only what is expected by the DRAM, not what is actually provided to the DRAM.

FIG. 11 is a timing illustrating a timing design 1100 showing how the buffer chip does this. The memory controller issues the write operation at t0. In FIG. 10, the write operation appeared at the DRAM circuits one clock later at t1, due to the inherent delay through the buffer chip. But in FIG. 11, in addition to the inherent one clock delay, the buffer chip has added an extra two clocks of delay to the write operation, which is not issued to the DRAM chips until t0+1+2=t3. Because the DRAM chips receive the write operation at t3 and have a write CAS latency of three clocks, they expect to receive the write data at t3+3=t6. Because the memory controller issued the write operation at t0, and it expects a write CAS latency of five clocks, it issues the write data at time t0+5=t5. After a one clock delay through the buffer chip, the write data arrives at the DRAM chips at t5+1=t6, and the timing problem is solved.

It should be noted that extra delay of j clocks (beyond the inherent delay) which the buffer chip deliberately adds before issuing the write operation to the DRAM is the sum j clocks of the inherent delay of the address and control signals and the inherent delay of the data signals. In the example shown, both those inherent delays are one clock, so j=2.

FIG. 12 is a timing diagram illustrating operation of an FB-DIMM's AMB, which may be designed to send write data earlier to buffered stacks instead of delaying the write address and operation (as in FIG. 11). Specifically, it may use an early write CAS latency 1202 to compensate the timing of the buffer chip write operation. If the buffer chip has a cumulative (address and data) inherent delay of two clocks, the AMB may send the write data to the buffered stack two clocks early. This may not be possible in the case of registered DIMMs, in which the memory controller sends the write data directly to the buffered stacks (rather than via the AMB). In another embodiment, the memory controller itself could be designed to send write data early, to compensate for the j clocks of cumulative inherent delay caused by the buffer chip.

In the example shown, the memory controller issues the write operation at t0. After a one clock inherent delay through the buffer chip, the write operation arrives at the DRAM at t1. The DRAM expects the write data at t1+3=t4. The industry specification would suggest a nominal write data time of t0+5=t5, but the AMB (or memory controller), which already has the write data (which are provided with the write operation), is configured to perform an early write at t5−2=t3. After the inherent delay 1203 through the buffer chip, the write data arrive at the DRAM at t3+1=t4, exactly when the DRAM expects it—specifically, with a three-cycle DRAM Write CAS latency 1204 which is equal to the three-cycle Early Write CAS Latency 1202.

FIG. 13 is a timing diagram 1300 illustrating bus conflicts which can be caused by delayed write operations. The delaying of write addresses and write operations may be performed by a buffer chip, a register, an AMB, etc. in a manner that is completely transparent to the memory controller of the host system. And, because the memory controller is unaware of this delay, it may schedule subsequent operations such as activate or precharge operations, which may collide with the delayed writes on the address bus to the DRAM chips in the stack.

An example is shown, in which the memory controller issues a write operation 1302 at time t0. The buffer chip or AMB delays the write operation, such that it appears on the bus to the DRAM chips at time t3. Unfortunately, at time t2 the memory controller issued an activate operation (control signal) 1304 which, after a one-clock inherent delay through the buffer chip, appears on the bus to the DRAM chips at time t3, colliding with the delayed write.

FIGS. 14 and 15 are a timing diagram 1400 and a timing diagram 1500 illustrating methods of avoiding such collisions. If the cumulative latency through the buffer chip is two clock cycles, and the native read CAS latency of the DRAM chips is four clock cycles, then in order to hide the delay of the address and control signals and the data signals through the buffer chip, the buffer chip presents the host system with an interface to an emulated memory having a read CAS latency of six clock cycles. And if the tRCD and tRP of the DRAM chips are four clock cycles each, the buffer chip tells the host system that they are six clock cycles each in order to allow the buffer chip to delay the activate and precharge operations to avoid collisions in a manner that is transparent to the host system.

For example, a buffered stack that uses 4-4-4 DRAM chips (that is, CAS latency=4, tRCD=4, and tRP=4) may appear to the host system as one larger DRAM that uses 6-6-6 timing.

Since the buffered stack appears to the host system's memory controller as having a tRCD of six clock cycles, the memory controller may schedule a column operation to a bank six clock cycles (at time t6) after an activate (row) operation (at time t0) to the same bank. However, the DRAM chips in the stack actually have a tRCD of four clock cycles. This gives the buffer chip time to delay the activate operation by up to two clock cycles, avoiding any conflicts on the address bus between the buffer chip and the DRAM chips, while ensuring correct read and write timing on the channel between the memory controller and the buffered stack.

As shown, the buffer chip may issue the activate operation to the DRAM chips one, two, or three clock cycles after it receives the activate operation from the memory controller, register, or AMB. The actual delay selected may depend on the presence or absence of other DRAM operations that may conflict with the activate operation, and may optionally change from one activate operation to another. In other words, the delay may be dynamic. A one-clock delay (1402A, 1502A) may be accomplished simply by the inherent delay through the buffer chip. A two-clock delay (1402B, 1502B) may be accomplished by adding one clock of additional delay to the one-clock inherent delay, and a three-clock delay (1402C, 1502C) may be accomplished by adding two clocks of additional delay to the one-clock inherent delay. A read, write, or activate operation issued by the memory controller at time t6 will, after a one-clock inherent delay through the buffer chip, be issued to the DRAM chips at time t7. A preceding activate or precharge operation issued by the memory controller at time t0 will, depending upon the delay, be issued to the DRAM chips at time t1, t2, or t3, each of which is at least the tRCD or tRP of four clocks earlier than the t7 issuance of the read, write, or activate operation.

Since the buffered stack appears to the memory controller to have a tRP of six clock cycles, the memory controller may schedule a subsequent activate (row) operation to a bank a minimum of six clock cycles after issuing a precharge operation to that bank. However, since the DRAM circuits in the stack actually have a tRP of four clock cycles, the buffer chip may have the ability to delay issuing the precharge operation to the DRAM chips by up to two clock cycles, in order to avoid any conflicts on the address bus, or in order to satisfy the tRAS requirements of the DRAM chips.

In particular, if the activate operation to a bank was delayed to avoid an address bus conflict, then the precharge operation to the same bank may be delayed by the buffer chip to satisfy the tRAS requirements of the DRAM. The buffer chip may issue the precharge operation to the DRAM chips one, two, or three clock cycles after it is received. The delay selected may depend on the presence or absence of address bus conflicts or tRAS violations, and may change from one precharge operation to another.

FIG. 16 illustrates a buffered stack 1600 according to one embodiment of this invention. The buffered stack includes four 512 Mb DDR2 DRAM circuits (chips) 1602 which a buffer chip 1604 maps to a single 2 Gb DDR2 DRAM.

Although the multiple DRAM chips appear to the memory controller as though they were a single, larger DRAM, the combined power dissipation of the actual DRAM chips may be much higher than the power dissipation of a monolithic DRAM of the same capacity. In other words, the physical DRAM may consume significantly more power than would be consumed by the emulated DRAM.

As a result, a DIMM containing multiple buffered stacks may dissipate much more power than a standard DIMM of the same actual capacity using monolithic DRAM circuits. This increased power dissipation may limit the widespread adoption of DIMMs that use buffered stacks. Thus, it is desirable to have a power management technique which reduces the power dissipation of DIMMs that use buffered stacks.

In one such technique, the DRAM circuits may be opportunistically placed in low power states or modes. For example, the DRAM circuits may be placed in a precharge power down mode using the clock enable (CKE) pin of the DRAM circuits.

A single rank registered DIMM (R-DIMM) may contain a plurality of buffered stacks, each including four x4 512 Mb DDR2 SDRAM chips and appear (to the memory controller via emulation by the buffer chip) as a single x4 2 Gb DDR2 SDRAM. The JEDEC standard indicates that a 2 Gb DDR2 SDRAM may generally have eight banks, shown in FIG. 16 as Bank 0 to Bank 7. Therefore, the buffer chip may map each 512 Mb DRAM chip in the stack to two banks of the equivalent 2 Gb DRAM, as shown; the first DRAM chip 1602A is treated as containing banks 0 and 1, 1602B is treated as containing banks 2 and 4, and so forth.

The memory controller may open and close pages in the DRAM banks based on memory requests it receives from the rest of the host system. In some embodiments, no more than one page may be able to be open in a bank at any given time. In the embodiment shown in FIG. 16, each DRAM chip may therefore have up to two pages open at a time. When a DRAM chip has no open pages, the power management scheme may place it in the precharge power down mode.

The clock enable inputs of the DRAM chips may be controlled by the buffer chip, or by another chip (not shown) on the R-DIMM, or by an AMB (not shown) in the case of an FB-DIMM, or by the memory controller, to implement the power management technique. The power management technique may be particularly effective if it implements a closed page policy.

Another optional power management technique may include mapping a plurality of DRAM circuits to a single bank of the larger capacity emulated DRAM. For example, a buffered stack (not shown) of sixteen x4 256 Mb DDR2 SDRAM chips may be used in emulating a single x4 4 Gb DDR2 SDRAM. The 4 Gb DRAM is specified by JEDEC as having eight banks of 512 Mbs each, so two of the 256 Mb DRAM chips may be mapped by the buffer chip to emulate each bank (whereas in FIG. 16 one DRAM was used to emulate two banks).

However, since only one page can be open in a bank at any given time, only one of the two DRAM chips emulating that bank can be in the active state at any given time. If the memory controller opens a page in one of the two DRAM chips, the other may be placed in the precharge power down mode. Thus, if a number p of DRAM chips are used to emulate one bank, at least p−1 of them may be in a power down mode at any given time; in other words, at least p−1 of the p chips are always in power down mode, although the particular powered down chips will tend to change over time, as the memory controller opens and closes various pages of memory.

As a caveat on the term “always” in the preceding paragraph, the power saving operation may comprise operating in precharge power down mode except when refresh is required.

FIG. 17 is a flow chart 1700 illustrating one embodiment of a method of refreshing a plurality of memory circuits. A refresh control signal is received (1702) e.g. from a memory controller which intends to refresh an emulated memory circuit. In response to receipt of the refresh control signal, a plurality of refresh control signals are sent (1704) e.g. by a buffer chip to a plurality of physical memory circuits at different times. These refresh control signals may optionally include the received refresh control signal or an instantiation or copy thereof. They may also, or instead, include refresh control signals that are different in at least one aspect (format, content, etc.) from the received signal.

In some embodiments, at least one first refresh control signal may be sent to a first subset of the physical memory circuits at a first time, and at least one second refresh control signal may be sent to a second subset of the physical memory circuits at a second time. Each refresh signal may be sent to one physical memory circuit, or to a plurality of physical memory circuits, depending upon the particular implementation.

The refresh control signals may be sent to the physical memory circuits after a delay in accordance with a particular timing. For example, the timing in which they are sent to the physical memory circuits may be selected to minimize an electrical current drawn by the memory, or to minimize a power consumption of the memory. This may be accomplished by staggering a plurality of refresh control signals. Or, the timing may be selected to comply with e.g. a tRFC parameter associated with the memory circuits.

To this end, physical DRAM circuits may receive periodic refresh operations to maintain integrity of data stored therein. A memory controller may initiate refresh operations by issuing refresh control signals to the DRAM circuits with sufficient frequency to prevent any loss of data in the DRAM circuits. After a refresh control signal is issued, a minimum time tRFC may be required to elapse before another control signal may be issued to that DRAM circuit. The tRFC parameter value may increase as the size of the DRAM circuit increases.

When the buffer chip receives a refresh control signal from the memory controller, it may refresh the smaller DRAM circuits within the span of time specified by the tRFC of the emulated DRAM circuit. Since the tRFC of the larger, emulated DRAM is longer than the tRFC of the smaller, physical DRAM circuits, it may not be necessary to issue any or all of the refresh control signals to the physical DRAM circuits simultaneously. Refresh control signals may be issued separately to individual DRAM circuits or to groups of DRAM circuits, provided that the tRFC requirements of all physical DRAMs has been met by the time the emulated DRAM's tRFC has elapsed. In use, the refreshes may be spaced in time to minimize the peak current draw of the combination buffer chip and DRAM circuit set during a refresh operation.

FIG. 18 illustrates one embodiment of an interface circuit such as may be utilized in any of the above-described memory systems, for interfacing between a system and memory circuits. The interface circuit may be included in the buffer chip, for example.

The interface circuit includes a system address signal interface for sending/receiving address signals to/from the host system, a system control signal interface for sending/receiving control signals to/from the host system, a system clock signal interface for sending/receiving clock signals to/from the host system, and a system data signal interface for sending/receiving data signals to/from the host system. The interface circuit further includes a memory address signal interface for sending/receiving address signals to/from the physical memory, a memory control signal interface for sending/receiving control signals to/from the physical memory, a memory clock signal interface for sending/receiving clock signals to/from the physical memory, and a memory data signal interface for sending/receiving data signals to/from the physical memory.

The host system includes a set of memory attribute expectations, or built-in parameters of the physical memory with which it has been designed to work (or with which it has been told, e.g. by the buffer circuit, it is working). Accordingly, the host system includes a set of memory interaction attributes, or built-in parameters according to which the host system has been designed to operate in its interactions with the memory. These memory interaction attributes and expectations will typically, but not necessarily, be embodied in the host system's memory controller.

In addition to physical storage circuits or devices, the physical memory itself has a set of physical attributes.

These expectations and attributes may include, by way of example only, memory timing, memory capacity, memory latency, memory functionality, memory type, memory protocol, memory power consumption, memory current requirements, and so forth.

The interface circuit includes memory physical attribute storage for storing values or parameters of various physical attributes of the physical memory circuits. The interface circuit further includes system emulated attribute storage. These storage systems may be read/write capable stores, or they may simply be a set of hard-wired logic or values, or they may simply be inherent in the operation of the interface circuit.

The interface circuit includes emulation logic which operates according to the stored memory physical attributes and the stored system emulation attributes, to present to the system an interface to an emulated memory which differs in at least one attribute from the actual physical memory. The emulation logic may, in various embodiments, alter a timing, value, latency, etc. of any of the address, control, clock, and/or data signals it sends to or receives from the system and/or the physical memory. Some such signals may pass through unaltered, while others may be altered. The emulation logic may be embodied as, for example, hard wired logic, a state machine, software executing on a processor, and so forth.

When one component is said to be “adjacent” another component, it should not be interpreted to mean that there is absolutely nothing between the two components, only that they are in the order indicated.

The physical memory circuits employed in practicing this invention may be any type of memory whatsoever, such as: DRAM, DDR DRAM, DDR2 DRAM, DDR3 DRAM, SDRAM, QDR DRAM, DRDRAM, FPM DRAM, VDRAM, EDO DRAM, BEDO DRAM, MDRAM, SGRAM, MRAM, IRAM, NAND flash, NOR flash, PSRAM, wetware memory, etc.

The physical memory circuits may be coupled to any type of memory module, such as: DIMM, R-DIMM, SO-DIMM, FB-DIMM, unbuffered DIMM, etc.

The system device which accesses the memory may be any type of system device, such as: desktop computer, laptop computer, workstation, server, consumer electronic device, television, personal digital assistant (PDA), mobile phone, printer or other peripheral device, etc.

Power-Related Embodiments

FIG. 19 illustrates a multiple memory circuit framework 1900, in accordance with one embodiment. As shown, included are an interface circuit 1902, a plurality of memory circuits 1904A, 1904B, 1904N, and a system 1906. In the context of the present description, such memory circuits 1904A, 1904B, 1904N may include any circuit capable of serving as memory.

For example, in various embodiments, at least one of the memory circuits 1904A, 1904B, 1904N may include a monolithic memory circuit, a semiconductor die, a chip, a packaged memory circuit, or any other type of tangible memory circuit. In one embodiment, the memory circuits 1904A, 1904B, 1904N may take the form of a dynamic random access memory (DRAM) circuit. Such DRAM may take any form including, but not limited to, synchronous DRAM (SDRAM), double data rate synchronous DRAM (DDR SDRAM, DDR2 SDRAM, DDR3 SDRAM, etc.), graphics double data rate DRAM (GDDR, GDDR2, GDDR3, etc.), quad data rate DRAM (QDR DRAM), RAMBUS XDR DRAM (XDR DRAM), fast page mode DRAM (FPM DRAM), video DRAM (VDRAM), extended data out DRAM (EDO DRAM), burst EDO RAM (BEDO DRAM), multibank DRAM (MDRAM), synchronous graphics RAM (SGRAM), and/or any other type of DRAM.

In another embodiment, at least one of the memory circuits 1904A, 1904B, 1904N may include magnetic random access memory (MRAM), intelligent random access memory (IRAM), distributed network architecture (DNA) memory, window random access memory (WRAM), flash memory (e.g. NAND, NOR, etc.), pseudostatic random access memory (PSRAM), wetware memory, memory based on semiconductor, atomic, molecular, optical, organic, biological, chemical, or nanoscale technology, and/or any other type of volatile or nonvolatile, random or non-random access, serial or parallel access memory circuit.

Strictly as an option, the memory circuits 1904A, 1904B, 1904N may or may not be positioned on at least one dual in-line memory module (DIMM) (not shown). In various embodiments, the DIMM may include a registered DIMM (R-DIMM), a small outline-DIMM (SO-DIMM), a fully buffered DIMM (FB-DIMM), an unbuffered DIMM (UDIMM), single inline memory module (SIMM), a MiniDIMM, a very low profile (VLP) R-DIMM, etc. In other embodiments, the memory circuits 1904A, 1904B, 1904N may or may not be positioned on any type of material forming a substrate, card, module, sheet, fabric, board, carrier or other any other type of solid or flexible entity, form, or object. Of course, in other embodiments, the memory circuits 1904A, 1904B, 1904N may or may not be positioned in or on any desired entity, form, or object for packaging purposes. Still yet, the memory circuits 1904A, 1904B, 1904N may or may not be organized into ranks Such ranks may refer to any arrangement of such memory circuits 1904A, 1904B, 1904N on any of the foregoing entities, forms, objects, etc.

Further, in the context of the present description, the system 1906 may include any system capable of requesting and/or initiating a process that results in an access of the memory circuits 1904A, 1904B, 1904N. As an option, the system 1906 may accomplish this utilizing a memory controller (not shown), or any other desired mechanism. In one embodiment, such system 1906 may include a system in the form of a desktop computer, a lap-top computer, a server, a storage system, a networking system, a workstation, a personal digital assistant (PDA), a mobile phone, a television, a computer peripheral (e.g. printer, etc.), a consumer electronics system, a communication system, and/or any other software and/or hardware, for that matter.

The interface circuit 1902 may, in the context of the present description, refer to any circuit capable of interfacing (e.g. communicating, buffering, etc.) with the memory circuits 1904A, 1904B, 1904N and the system 1906. For example, the interface circuit 1902 may, in the context of different embodiments, include a circuit capable of directly (e.g. via wire, bus, connector, and/or any other direct communication medium, etc.) and/or indirectly (e.g. via wireless, optical, capacitive, electric field, magnetic field, electromagnetic field, and/or any other indirect communication medium, etc.) communicating with the memory circuits 1904A, 1904B, 1904N and the system 1906. In additional different embodiments, the communication may use a direct connection (e.g. point-to-point, single-drop bus, multi-drop bus, serial bus, parallel bus, link, and/or any other direct connection, etc.) or may use an indirect connection (e.g. through intermediate circuits, intermediate logic, an intermediate bus or busses, and/or any other indirect connection, etc.).

In additional optional embodiments, the interface circuit 1902 may include one or more circuits, such as a buffer (e.g. buffer chip, etc.), register (e.g. register chip, etc.), advanced memory buffer (AMB) (e.g. AMB chip, etc.), a component positioned on at least one DIMM, etc. Moreover, the register may, in various embodiments, include a JEDEC Solid State Technology Association (known as JEDEC) standard register (a JEDEC register), a register with forwarding, storing, and/or buffering capabilities, etc. In various embodiments, the register chips, buffer chips, and/or any other interface circuit(s) 1902 may be intelligent, that is, include logic that are capable of one or more functions such as gathering and/or storing information; inferring, predicting, and/or storing state and/or status; performing logical decisions; and/or performing operations on input signals, etc. In still other embodiments, the interface circuit 1902 may optionally be manufactured in monolithic form, packaged form, printed form, and/or any other manufactured form of circuit, for that matter.

In still yet another embodiment, a plurality of the aforementioned interface circuits 1902 may serve, in combination, to interface the memory circuits 1904A, 1904B, 1904N and the system 1906. Thus, in various embodiments, one, two, three, four, or more interface circuits 1902 may be utilized for such interfacing purposes. In addition, multiple interface circuits 1902 may be relatively configured or connected in any desired manner. For example, the interface circuits 1902 may be configured or connected in parallel, serially, or in various combinations thereof. The multiple interface circuits 1902 may use direct connections to each other, indirect connections to each other, or even a combination thereof. Furthermore, any number of the interface circuits 1902 may be allocated to any number of the memory circuits 1904A, 1904B, 1904N. In various other embodiments, each of the plurality of interface circuits 1902 may be the same or different. Even still, the interface circuits 1902 may share the same or similar interface tasks and/or perform different interface tasks.

While the memory circuits 1904A, 1904B, 1904N, interface circuit 1902, and system 1906 are shown to be separate parts, it is contemplated that any of such parts (or portion(s) thereof) may be integrated in any desired manner. In various embodiments, such optional integration may involve simply packaging such parts together (e.g. stacking the parts to form a stack of DRAM circuits, a DRAM stack, a plurality of DRAM stacks, a hardware stack, where a stack may refer to any bundle, collection, or grouping of parts and/or circuits, etc.) and/or integrating them monolithically. Just by way of example, in one optional embodiment, at least one interface circuit 1902 (or portion(s) thereof) may be packaged with at least one of the memory circuits 1904A, 1904B, 1904N. Thus, a DRAM stack may or may not include at least one interface circuit (or portion(s) thereof). In other embodiments, different numbers of the interface circuit 1902 (or portion(s) thereof) may be packaged together. Such different packaging arrangements, when employed, may optionally improve the utilization of a monolithic silicon implementation, for example.

The interface circuit 1902 may be capable of various functionality, in the context of different embodiments. For example, in one optional embodiment, the interface circuit 1902 may interface a plurality of signals 1908 that are connected between the memory circuits 1904A, 1904B, 1904N and the system 1906. The signals may, for example, include address signals, data signals, control signals, enable signals, clock signals, reset signals, or any other signal used to operate or associated with the memory circuits, system, or interface circuit(s), etc. In some optional embodiments, the signals may be those that: use a direct connection, use an indirect connection, use a dedicated connection, may be encoded across several connections, and/or may be otherwise encoded (e.g. time-multiplexed, etc.) across one or more connections.

In one aspect of the present embodiment, the interfaced signals 1908 may represent all of the signals that are connected between the memory circuits 1904A, 1904B, 1904N and the system 1906. In other aspects, at least a portion of signals 1910 may use direct connections between the memory circuits 1904A, 1904B, 1904N and the system 1906. Moreover, the number of interfaced signals 1908 (e.g. vs. a number of the signals that use direct connections 1910, etc.) may vary such that the interfaced signals 1908 may include at least a majority of the total number of signal connections between the memory circuits 1904A, 1904B, 1904N and the system 1906 (e.g. L>M, with L and M as shown in FIG. 19). In other embodiments, L may be less than or equal to M. In still other embodiments L and/or M may be zero.

In yet another embodiment, the interface circuit 1902 may or may not be operable to interface a first number of memory circuits 1904A, 1904B, 1904N and the system 1906 for simulating a second number of memory circuits to the system 1906. The first number of memory circuits 1904A, 1904B, 1904N shall hereafter be referred to, where appropriate for clarification purposes, as the “physical” memory circuits or memory circuits, but are not limited to be so. Just by way of example, the physical memory circuits may include a single physical memory circuit. Further, the at least one simulated memory circuit seen by the system 1906 shall hereafter be referred to, where appropriate for clarification purposes, as the at least one “virtual” memory circuit.

In still additional aspects of the present embodiment, the second number of virtual memory circuits may be more than, equal to, or less than the first number of physical memory circuits 1904A, 1904B, 1904N. Just by way of example, the second number of virtual memory circuits may include a single memory circuit. Of course, however, any number of memory circuits may be simulated.

In the context of the present description, the term simulated may refer to any simulating, emulating, disguising, transforming, modifying, changing, altering, shaping, converting, etc., that results in at least one aspect of the memory circuits 1904A, 1904B, 1904N appearing different to the system 1906. In different embodiments, such aspect may include, for example, a number, a signal, a memory capacity, a timing, a latency, a design parameter, a logical interface, a control system, a property, a behavior (e.g. power behavior including, but not limited to a power consumption, current consumption, current waveform, power parameters, power metrics, any other aspect of power management or behavior, etc.), and/or any other aspect, for that matter.

In different embodiments, the simulation may be electrical in nature, logical in nature, protocol in nature, and/or performed in any other desired manner. For instance, in the context of electrical simulation, a number of pins, wires, signals, etc. may be simulated. In the context of logical simulation, a particular function or behavior may be simulated. In the context of protocol, a particular protocol (e.g. DDR3, etc.) may be simulated. Further, in the context of protocol, the simulation may effect conversion between different protocols (e.g. DDR2 and DDR3) or may effect conversion between different versions of the same protocol (e.g. conversion of 4-4-4 DDR2 to 6-6-6 DDR2).

During use, in accordance with one optional power management embodiment, the interface circuit 1902 may or may not be operable to interface the memory circuits 1904A, 1904B, 1904N and the system 1906 for simulating at least one virtual memory circuit, where the virtual memory circuit includes at least one aspect that is different from at least one aspect of one or more of the physical memory circuits 1904A, 1904B, 1904N. Such aspect may, in one embodiment, include power behavior (e.g. a power consumption, current consumption, current waveform, any other aspect of power management or behavior, etc.). Specifically, in such embodiment, the interface circuit 1902 is operable to interface the physical memory circuits 1904A, 1904B, 1904N and the system 1906 for simulating at least one virtual memory circuit with a first power behavior that is different from a second power behavior of the physical memory circuits 1904A, 1904B, 1904N. Such power behavior simulation may effect or result in a reduction or other modification of average power consumption, reduction or other modification of peak power consumption or other measure of power consumption, reduction or other modification of peak current consumption or other measure of current consumption, and/or modification of other power behavior (e.g. parameters, metrics, etc.). In one embodiment, such power behavior simulation may be provided by the interface circuit 1902 performing various power management.

In another power management embodiment, the interface circuit 1902 may perform a power management operation in association with only a portion of the memory circuits. In the context of the present description, a portion of memory circuits may refer to any row, column, page, bank, rank, sub-row, sub-column, sub-page, sub-bank, sub-rank, any other subdivision thereof, and/or any other portion or portions of one or more memory circuits. Thus, in an embodiment where multiple memory circuits exist, such portion may even refer to an entire one or more memory circuits (which may be deemed a portion of such multiple memory circuits, etc.). Of course, again, the portion of memory circuits may refer to any portion or portions of one or more memory circuits. This applies to both physical and virtual memory circuits.

In various additional power management embodiments, the power management operation may be performed by the interface circuit 1902 during a latency associated with one or more commands directed to at least a portion of the plurality of memory circuits 1904A, 1904B, 1904N. In the context of the present description, such command(s) may refer to any control signal (e.g. one or more address signals; one or more data signals; a combination of one or more control signals; a sequence of one or more control signals; a signal associated with an activate (or active) operation, precharge operation, write operation, read operation, a mode register write operation, a mode register read operation, a refresh operation, or other encoded or direct operation, command or control signal; etc.). In one optional embodiment where the interface circuit 1902 is further operable for simulating at least one virtual memory circuit, such virtual memory circuit(s) may include a first latency that is different than a second latency associated with at least one of the plurality of memory circuits 1904A, 1904B, 1904N. In use, such first latency may be used to accommodate the power management operation.

Yet another embodiment is contemplated where the interface circuit 1902 performs the power management operation in association with at least a portion of the memory circuits, in an autonomous manner. Such autonomous performance refers to the ability of the interface circuit 1902 to perform the power management operation without necessarily requiring the receipt of an associated power management command from the system 1906.

In still additional embodiments, interface circuit 1902 may receive a first number of power management signals from the system 1906 and may communicate a second number of power management signals that is the same or different from the first number of power management signals to at least a portion of the memory circuits 1904A, 1904B, 1904N. In the context of the present description, such power management signals may refer to any signal associated with power management, examples of which will be set forth hereinafter during the description of other embodiments. In still another embodiment, the second number of power management signals may be utilized to perform power management of the portion(s) of memory circuits in a manner that is independent from each other and/or independent from the first number of power management signals received from the system 1906 (which may or may not also be utilized in a manner that is independent from each other). In even still yet another embodiment where the interface circuit 1902 is further operable for simulating at least one virtual memory circuit, a number of the aforementioned ranks (seen by the system 1906) may be less than the first number of power management signals.

In other power management embodiments, the interface circuit 1902 may be capable of a power management operation that takes the form of a power saving operation. In the context of the present description, the term power saving operation may refer to any operation that results in at least some power savings.

It should be noted that various power management operation embodiments, power management signal embodiments, simulation embodiments (and any other embodiments, for that matter) may or may not be used in conjunction with each other, as well as the various different embodiments that will hereinafter be described. To this end, more illustrative information will now be set forth regarding optional functionality/architecture of different embodiments which may or may not be implemented in the context of such interface circuit 1902 and the related components of FIG. 19, per the desires of the user. It should be strongly noted that the following information is set forth for illustrative purposes and should not be construed as limiting in any manner. For example, any of the following features may be optionally incorporated with or without the other features described.

Additional Power Management Embodiments

In one exemplary power management embodiment, the aforementioned simulation of a different power behavior may be achieved utilizing a power saving operation.

In one such embodiment, the power management, power behavior simulation, and thus the power saving operation may optionally include applying a power saving command to one or more memory circuits based on at least one state of one or more memory circuits. Such power saving command may include, for example, initiating a power down operation applied to one or more memory circuits. Further, such state may depend on identification of the current, past or predictable future status of one or more memory circuits, a predetermined combination of commands issued to the one or more memory circuits, a predetermined pattern of commands issued to the one or more memory circuits, a predetermined absence of commands issued to the one or more memory circuits, any command(s) issued to the one or more memory circuits, and/or any command(s) issued to one or more memory circuits other than the one or more memory circuits. In the context of the present description, such status may refer to any property of the memory circuit that may be monitored, stored, and/or predicted.

For example, at least one of a plurality of memory circuits may be identified that is not currently being accessed by the system. Such status identification may involve determining whether a portion(s) is being accessed in at least one of the plurality of memory circuits. Of course, any other technique may be used that results in the identification of at least one of the memory circuits (or portion(s) thereof) that is not being accessed, e.g. in a non-accessed state. In other embodiments, other such states may be detected or identified and used for power management.

In response to the identification of a memory circuit in a non-accessed state, a power saving operation may be initiated in association with the non-accessed memory circuit (or portion thereof). In one optional embodiment, such power saving operation may involve a power down operation (e.g. entry into a precharge power down mode, as opposed to an exit therefrom, etc.). As an option, such power saving operation may be initiated utilizing (e.g. in response to, etc.) a power management signal including, but not limited to a clock enable signal (CKE), chip select signal, in combination with other signals and optionally commands. In other embodiments, use of a non-power management signal (e.g. control signal, etc.) is similarly contemplated for initiating the power saving operation. Of course, however, it should be noted that anything that results in modification of the power behavior may be employed in the context of the present embodiment.

As mentioned earlier, the interface circuit may be operable to interface the memory circuits and the system for simulating at least one virtual memory circuit, where the virtual memory circuit includes at least one aspect that is different from at least one aspect of one or more of the physical memory circuits. In different embodiments, such aspect may include, for example, a signal, a memory capacity, a timing, a logical interface, etc. As an option, one or more of such aspects may be simulated for supporting a power management operation.

For example, the simulated timing, as described above, may include a simulated latency (e.g. time delay, etc.). In particular, such simulated latency may include a column address strobe (CAS) latency (e.g. a latency associated with accessing a column of data). Still yet, the simulated latency may include a row address to column address latency (tRCD). Thus, the latency may be that between the row address strobe (RAS) and CAS.

In addition, the simulated latency may include a row precharge latency (tRP). The tRP may include the latency to terminate access to an open row. Further, the simulated latency may include an activate to precharge latency (tRAS). The tRAS may include the latency between an activate operation and a precharge operation. Furthermore, the simulated latency may include a row cycle time (tRC). The tRC may include the latency between consecutive activate operations to the same bank of a DRAM circuit. In some embodiments, the simulated latency may include a read latency, write latency, or latency associated with any other operation(s), command(s), or combination or sequence of operations or commands. In other embodiments, the simulated latency may include simulation of any latency parameter that corresponds to the time between two events.

For example, in one exemplary embodiment using simulated latency, a first interface circuit may delay address and control signals for certain operations or commands by a clock cycles. In various embodiments where the first interface circuit is operating as a register or may include a register, a may not necessarily include the register delay (which is typically a one clock cycle delay through a JEDEC register). Also in the present exemplary embodiment, a second interface circuit may delay data signals by d clock cycles. It should be noted that the first and second interface circuits may be the same or different circuits or components in various embodiments. Further, the delays a and d may or may not be different for different memory circuits. In other embodiments, the delays a and d may apply to address and/or control and/or data signals. In alternative embodiments, the delays a and d may not be integer or even constant multiples of the clock cycle and may be less than one clock cycle or zero.

The cumulative delay through the interface circuits (e.g. the sum of the first delay a of the address and control signals through the first interface circuit and the second delay d of the data signals through the second interface circuit) may be j clock cycles (e.g. j=a+d). Thus, in a DRAM-specific embodiment, in order to make the delays a and d transparent to the memory controller, the interface circuits may make the stack of DRAM circuits appear to a memory controller (or any other component, system, or part(s) of a system) as one (or more) larger capacity virtual DRAM circuits with a read latency of i+j clocks, where i is the inherent read latency of the physical DRAM circuits.

To this end, the interface circuits may be operable for simulating at least one virtual memory circuit with a first latency that may be different (e.g. equal, longer, shorter, etc.) than a second latency of at least one of the physical memory circuits. The interface circuits may thus have the ability to simulate virtual DRAM circuits with a possibly different (e.g. increased, decreased, equal, etc.) read or other latency to the system, thus making transparent the delay of some or all of the address, control, clock, enable, and data signals through the interface circuits. This simulated aspect, in turn, may be used to accommodate power management of the DRAM circuits. More information regarding such use will be set forth hereinafter in greater detail during reference to different embodiments outlined in subsequent figures.

In still another embodiment, the interface circuit may be operable to receive a signal from the system and communicate the signal to at least one of the memory circuits after a delay. The signal may refer to one of more of a control signal, a data signal, a clock signal, an enable signal, a reset signal, a logical or physical signal, a combination or pattern of such signals, or a sequence of such signals, and/or any other signal for that matter. In various embodiments, such delay may be fixed or variable (e.g. a function of a current signal, and/or a previous signal, and/or a signal that will be communicated, after a delay, at a future time, etc.). In still other embodiments, the interface circuit may be operable to receive one or more signals from at least one of the memory circuits and communicate the signal(s) to the system after a delay.

As an option, the signal delay may include a cumulative delay associated with one or more of the aforementioned signals. Even still, the signal delay may result in a time shift of the signal (e.g. forward and/or back in time) with respect to other signals. Of course, such forward and backward time shift may or may not be equal in magnitude.

In one embodiment, the time shifting may be accomplished utilizing a plurality of delay functions which each apply a different delay to a different signal. In still additional embodiments, the aforementioned time shifting may be coordinated among multiple signals such that different signals are subject to shifts with different relative directions/magnitudes. For example, such time shifting may be performed in an organized manner. Yet again, more information regarding such use of delay in the context of power management will be set forth hereinafter in greater detail during reference to subsequent figures.

Embodiments with Varying Physical Stack Arrangements

FIGS. 20A-E show a stack of DRAM circuits 2000 that utilize one or more interface circuits, in accordance with various embodiments. As an option, the stack of DRAM circuits 2000 may be implemented in the context of the architecture of FIG. 19. Of course, however, the stack of DRAM circuits 2000 may be implemented in any other desired environment (e.g. using other memory types, using different memory types within a stack, etc.). It should also be noted that the aforementioned definitions may apply during the present description.

As shown in FIGS. 20A-E, one or more interface circuits 2002 may be placed electrically between an electronic system 2004 and a stack of DRAM circuits 2006A-D. Thus the interface circuits 2002 electrically sit between the electronic system 2004 and the stack of DRAM circuits 2006A-D. In the context of the present description, the interface circuit(s) 2002 may include any interface circuit that meets the definition set forth during reference to FIG. 19.

In the present embodiment, the interface circuit(s) 2002 may be capable of interfacing (e.g. buffering, etc.) the stack of DRAM circuits 2006A-D to electrically and/or logically resemble at least one larger capacity virtual DRAM circuit to the system 2004. Thus, a stack or buffered stack may be utilized. In this way, the stack of DRAM circuits 2006A-D may appear as a smaller quantity of larger capacity virtual DRAM circuits to the system 2004.

Just by way of example, the stack of DRAM circuits 2006A-D may include eight 512 Mb DRAM circuits. Thus, the interface circuit(s) 2002 may buffer the stack of eight 512 Mb DRAM circuits to resemble a single 4 Gb virtual DRAM circuit to a memory controller (not shown) of the associated system 2004. In another example, the interface circuit(s) 2002 may buffer the stack of eight 512 Mb DRAM circuits to resemble two 2 Gb virtual DRAM circuits to a memory controller of an associated system 2004.

Furthermore, the stack of DRAM circuits 2006A-D may include any number of DRAM circuits. Just by way of example, the interface circuit(s) 2002 may be connected to 1, 2, 4, 8 or more DRAM circuits 2006A-D. In alternate embodiments, to permit data integrity storage or for other reasons, the interface circuit(s) 2002 may be connected to an odd number of DRAM circuits 2006A-D. Additionally, the DRAM circuits 2006A-D may be arranged in a single stack. Of course, however, the DRAM circuits 2006A-D may also be arranged in a plurality of stacks

The DRAM circuits 2006A-D may be arranged on, located on, or connected to a single side of the interface circuit(s) 2002, as shown in FIGS. 20A-D. As another option, the DRAM circuits 2006A-D may be arranged on, located on, or connected to both sides of the interface circuit(s) 2002 shown in FIG. 20E. Just by way of example, the interface circuit(s) 2002 may be connected to 16 DRAM circuits with 8 DRAM circuits on either side of the interface circuit(s) 2002, where the 8 DRAM circuits on each side of the interface circuit(s) 2002 are arranged in two stacks of four DRAM circuits. In other embodiments, other arrangements and numbers of DRAM circuits are possible (e.g. to implement error-correction coding, ECC, etc.)

The interface circuit(s) 2002 may optionally be a part of the stack of DRAM circuits 2006A-D. Of course, however, interface circuit(s) 2002 may also be separate from the stack of DRAM circuits 2006A-D. In addition, interface circuit(s) 2002 may be physically located anywhere in the stack of DRAM circuits 2006A-D, where such interface circuit(s) 2002 electrically sits between the electronic system 2004 and the stack of DRAM circuits 2006A-D.

In one embodiment, the interface circuit(s) 2002 may be located at the bottom of the stack of DRAM circuits 2006A-D (e.g. the bottom-most circuit in the stack) as shown in FIGS. 20A-2D. As another option, and as shown in FIG. 200E, the interface circuit(s) 2002 may be located in the middle of the stack of DRAM circuits 2006A-D. As still yet another option, the interface circuit(s) 2002 may be located at the top of the stack of DRAM circuits 2006A-D (e.g. the top-most circuit in the stack). Of course, however, the interface circuit(s) 2002 may also be located anywhere between the two extremities of the stack of DRAM circuits 2006A-D. In alternate embodiments, the interface circuit(s) 2002 may not be in the stack of DRAM circuits 2006A-D and may be located in a separate package(s).

The electrical connections between the interface circuit(s) 2002 and the stack of DRAM circuits 2006A-D may be configured in any desired manner. In one optional embodiment, address, control (e.g. command, etc.), and clock signals may be common to all DRAM circuits 2006A-D in the stack (e.g. using one common bus). As another option, there may be multiple address, control and clock busses.

As yet another option, there may be individual address, control and clock busses to each DRAM circuit 2006A-D. Similarly, data signals may be wired as one common bus, several busses, or as an individual bus to each DRAM circuit 2006A-D. Of course, it should be noted that any combinations of such configurations may also be utilized.

For example, as shown in FIG. 20A, the DRAM circuits 2006A-D may have one common address, control and clock bus 2008 with individual data busses 2010. In another example, as shown in FIG. 20B, the DRAM circuits 2006A-D may have two address, control and clock busses 2008 along with two data busses 2010. In still yet another example, as shown in FIG. 20C, the DRAM circuits 2006A-D may have one address, control and clock bus 2008 together with two data busses 2010. In addition, as shown in FIG. 20D, the DRAM circuits 2006A-D may have one common address, control and clock bus 2008 and one common data bus 2010. It should be noted that any other permutations and combinations of such address, control, clock and data buses may be utilized.

In one embodiment, the interface circuit(s) 2002 may be split into several chips that, in combination, perform power management functions. Such power management functions may optionally introduce a delay in various signals.

For example, there may be a single register chip that electrically sits between a memory controller and a number of stacks of DRAM circuits. The register chip may, for example, perform the signaling to the DRAM circuits. Such register chip may be connected electrically to a number of other interface circuits that sit electrically between the register chip and the stacks of DRAM circuits. Such interface circuits in the stacks of DRAM circuits may then perform the aforementioned delay, as needed.

In another embodiment, there may be no need for an interface circuit in each DRAM stack. In particular, the register chip may perform the signaling to the DRAM circuits directly. In yet another embodiment, there may be no need for a stack of DRAM circuits. Thus each stack may be a single memory (e.g. DRAM) circuit. In other implementations, combinations of the above implementations may be used. Just by way of example, register chips may be used in combination with other interface circuits, or registers may be utilized alone.

More information regarding the verification that a simulated DRAM circuit including any address, data, control and clock configurations behaves according to a desired DRAM standard or other design specification will be set forth hereinafter in greater detail.

Additional Embodiments with Different Physical Memory module arrangements

FIGS. 21A-D show a memory module 2100 which uses DRAM circuits or stacks of DRAM circuits (e.g. DRAM stacks) with various interface circuits, in accordance with different embodiments. As an option, the memory module 2100 may be implemented in the context of the architecture and environment of FIGS. 19 and/or 20. Of course, however, the memory module 2100 may be implemented in any desired environment. It should also be noted that the aforementioned definitions may apply during the present description.

FIG. 21A shows two register chips 2104 driving address and control signals to DRAM circuits 2102. The DRAM circuits 2102 may send/receive data signals to and/or from a system (e.g. memory controller) using the DRAM data bus, as shown.

FIG. 21B shows one register chip 2104 driving address and control signals to DRAM circuits 2102. Thus, one, two, three, or more register chips 2104 may be utilized, in various embodiments.

FIG. 21C shows register chips 2104 driving address and control signals to DRAM circuits 2102 and/or intelligent interface circuits 2103. In addition, the DRAM data bus is connected to the intelligent interface circuits 2103 (not shown explicitly). Of course, as described herein, and illustrated in FIGS. 21A and 21B, one, two, three or more register chips 2104 may be used. Furthermore, this FIG. illustrates that the register chip(s) 2104 may drive some, all, or none of the control and/or address signals to intelligent interface circuits 2103.

FIG. 21D shows register chips 2104 driving address and control signals to the DRAM circuits 2102 and/or intelligent interface circuits 2103. Furthermore, this FIG. illustrates that the register chip(s) 2104 may drive some, all, or none of the control and/or address signals to intelligent interface circuits 2103. Again, the DRAM data bus is connected to the intelligent interface circuits 2103. Additionally, this FIG. illustrates that either one (in the case of DRAM stack 2106) or two (in the case of the other DRAM stacks 2102) stacks of DRAM circuits 2102 may be associated with a single intelligent interface circuit 2103.

Of course, however, any number of stacks of DRAM circuits 2102 may be associated with each intelligent interface circuit 2103. As another option, an AMB chip may be utilized with an FB-DIMM, as will be described in more detail with respect to FIGS. 22A-E.

FIGS. 22A-E show a memory module 2200 which uses DRAM circuits or stacks of DRAM circuits (e.g. DRAM stacks) 2202 with an AMB chip 2204, in accordance with various embodiments. As an option, the memory module 2200 may be implemented in the context of the architecture and environment of FIGS. 19-21. Of course, however, the memory module 2200 may be implemented in any desired environment. It should also be noted that the aforementioned definitions may apply during the present description.

FIG. 22A shows the AMB chip 2204 driving address and control signals to the DRAM circuits 2202. In addition, the AMB chip 2204 sends/receives data to/from the DRAM circuits 2202.

FIG. 22B shows the AMB chip 2204 driving address and control signals to a register 2206. In turn, the register 2206 may drive address and control signals to the DRAM circuits 2202. The DRAM circuits send/receive data to/from the AMB. Moreover, a DRAM data bus may be connected to the AMB chip 2204.

FIG. 22C shows the AMB chip 2204 driving address and control to the register 2206. In turn, the register 2206 may drive address and control signals to the DRAM circuits 2202 and/or the intelligent interface circuits 2203. This FIG. illustrates that the register 2206 may drive zero, one, or more address and/or control signals to one or more intelligent interface circuits 2203. Further, each DRAM data bus is connected to the interface circuit 2203 (not shown explicitly). The intelligent interface circuit data bus is connected to the AMB chip 2204. The AMB data bus is connected to the system.

FIG. 22D shows the AMB chip 2204 driving address and/or control signals to the DRAM circuits 2202 and/or the intelligent interface circuits 2203. This FIG. illustrates that the AMB chip 2204 may drive zero, one, or more address and/or control signals to one or more intelligent interface circuits 2203. Moreover, each DRAM data bus is connected to the intelligent interface circuits 2203 (not shown explicitly). The intelligent interface circuit data bus is connected to the AMB chip 2204. The AMB data bus is connected to the system.

FIG. 22E shows the AMB chip 2204 driving address and control to one or more intelligent interface circuits 2203. The intelligent interface circuits 2203 then drive address and control to each DRAM circuit 2202 (not shown explicitly). Moreover, each DRAM data bus is connected to the intelligent interface circuits 2203 (also not shown explicitly). The intelligent interface circuit data bus is connected to the AMB chip 2204. The AMB data bus is connected to the system.

In other embodiments, combinations of the above implementations as shown in FIGS. 22A-E may be utilized. Just by way of example, one or more register chips may be utilized in conjunction with the intelligent interface circuits. In other embodiments, register chips may be utilized alone and/or with or without stacks of DRAM circuits.

FIG. 23 shows a system 2300 in which four 512 Mb DRAM circuits appear, through simulation, as (e.g. mapped to) a single 2 Gb virtual DRAM circuit, in accordance with yet another embodiment. As an option, the system 2300 may be implemented in the context of the architecture and environment of FIGS. 19-22. Of course, however, the system 2300 may be implemented in any desired environment. It should also be noted that the aforementioned definitions may apply during the present description.

As shown in FIG. 23, a stack of memory circuits that is interfaced by the interface circuit for the purpose of simulation (e.g. a buffered stack) may include four 512 Mb physical DRAM circuits 2302A-D that appear to a memory controller as a single 2 Gb virtual DRAM circuit. In different embodiments, the buffered stack may include various numbers of physical DRAM circuits including two, four, eight, sixteen or even more physical DRAM circuits that appear to the memory controller as a single larger capacity virtual DRAM circuit or multiple larger capacity virtual DRAM circuits. In addition, the number of physical DRAM circuits in the buffered stack may be an odd number. For example, an odd number of circuits may be used to provide data redundancy or data checking or other features.

Also, one or more control signals (e.g. power management signals) 2306 may be connected between the interface circuit 2304 and the DRAM circuits 2302A-D in the stack. The interface circuit 2304 may be connected to a control signal (e.g. power management signal) 2308 from the system, where the system uses the control signal 2308 to control one aspect (e.g. power behavior) of the 2 Gb virtual DRAM circuit in the stack. The interface circuit 2304 may control the one aspect (e.g. power behavior) of all the DRAM circuits 2302A-D in response to a control signal 2308 from the system to the 2 Gb virtual DRAM circuit. The interface circuit 2304 may also, using control signals 2306, control the one aspect (e.g. power behavior) of one or more of the DRAM circuits 2302A-D in the stack in the absence of a control signal 2308 from the system to the 2 Gb virtual DRAM circuit.

The buffered stacks 2300 may also be used in combination together on a DIMM such that the DIMM appears to the memory controller as a larger capacity DIMM. The buffered stacks may be arranged in one or more ranks on the DIMM. All the virtual DRAM circuits on the DIMM that respond in parallel to a control signal 2308 (e.g. chip select signal, clock enable signal, etc.) from the memory controller belong to a single rank. However, the interface circuit 2304 may use a plurality of control signals 2306 instead of control signal 2308 to control DRAM circuits 2302A-D. The interface circuit 2304 may use all the control signals 2306 in parallel in response to the control signal 2308 to do power management of the DRAM circuits 2302A-D in one example. In another example, the interface circuit 2304 may use at least one but not all the control signals 2306 in response to the control signal 2308 to do power management of the DRAM circuits 2302A-D. In yet another example, the interface circuit 2304 may use at least one control signal 2306 in the absence of the control signal 2308 to do power management of the DRAM circuits 2302A-D.

More information regarding the verification that a memory module including DRAM circuits with various interface circuits behave according to a desired DRAM standard or other design specification will be set forth hereinafter in greater detail.

DRAM Bank Configuration Embodiments

The number of banks per DRAM circuit may be defined by JEDEC standards for many DRAM circuit technologies. In various embodiments, there may be different configurations that use different mappings between the physical DRAM circuits in a stack and the banks in each virtual DRAM circuit seen by the memory controller. In each configuration, multiple physical DRAM circuits 2302A-D may be stacked and interfaced by an interface circuit 2304 and may appear as at least one larger capacity virtual DRAM circuit to the memory controller. Just by way of example, the stack may include four 512 Mb DDR2 physical SDRAM circuits that appear to the memory controller as a single 2 Gb virtual DDR2 SDRAM circuit.

In one optional embodiment, each bank of a virtual DRAM circuit seen by the memory controller may correspond to a portion of a physical DRAM circuit. That is, each physical DRAM circuit may be mapped to multiple banks of a virtual DRAM circuit. For example, in one embodiment, four 512 Mb DDR2 physical SDRAM circuits through simulation may appear to the memory controller as a single 2 Gb virtual DDR2 SDRAM circuit. A 2 Gb DDR2 SDRAM may have eight banks as specified by the JEDEC standards. Therefore, in this embodiment, the interface circuit 2304 may map each 512 Mb physical DRAM circuit to two banks of the 2 Gb virtual DRAM. Thus, in the context of the present embodiment, a one-circuit-to-many-bank configuration (one physical DRAM circuit to many banks of a virtual DRAM circuit) may be utilized.

In another embodiment, each physical DRAM circuit may be mapped to a single bank of a virtual DRAM circuit. For example, eight 512 Mb DDR2 physical SDRAM circuits may appear to the memory controller, through simulation, as a single 4 Gb virtual DDR2 SDRAM circuit. A 4 Gb DDR2 SDRAM may have eight banks as specified by the JEDEC standards. Therefore, the interface circuit 2304 may map each 512 Mb physical DRAM circuit to a single bank of the 4 Gb virtual DRAM. In this way, a one-circuit-to-one-bank configuration (one physical DRAM circuit to one bank of a virtual DRAM circuit) may be utilized.

In yet another embodiment, a plurality of physical DRAM circuits may be mapped to a single bank of a virtual DRAM circuit. For example, sixteen 256 Mb DDR2 physical SDRAM circuits may appear to the memory controller, through simulation, as a single 4 Gb virtual DDR2 SDRAM circuit. A 4 Gb DDR2 SDRAM circuit may be specified by JEDEC to have eight banks, such that each bank of the 4 Gb DDR2 SDRAM circuit may be 512 Mb. Thus, two of the 256 Mb DDR2 physical SDRAM circuits may be mapped by the interface circuit 2304 to a single bank of the 4 Gb virtual DDR2 SDRAM circuit seen by the memory controller. Accordingly, a many-circuit-to-one-bank configuration (many physical DRAM circuits to one bank of a virtual DRAM circuit) may be utilized.

Thus, in the above described embodiments, multiple physical DRAM circuits 2302A-D in the stack may be buffered by the interface circuit 2304 and may appear as at least one larger capacity virtual DRAM circuit to the memory controller. Just by way of example, the buffered stack may include four 512 Mb DDR2 physical SDRAM circuits that appear to the memory controller as a single 2 Gb DDR2 virtual SDRAM circuit. In normal operation, the combined power dissipation of all four DRAM circuits 2302A-D in the stack when they are active may be higher than the power dissipation of a monolithic (e.g. constructed without stacks) 2 Gb DDR2 SDRAM.

In general, the power dissipation of a DIMM constructed from buffered stacks may be much higher than a DIMM constructed without buffered stacks. Thus, for example, a DIMM containing multiple buffered stacks may dissipate much more power than a standard DIMM built using monolithic DRAM circuits. However, power management may be utilized to reduce the power dissipation of DIMMs that contain buffered stacks of DRAM circuits. Although the examples described herein focus on power management of buffered stacks of DRAM circuits, techniques and methods described apply equally well to DIMMs that are constructed without stacking the DRAM circuits (e.g. a stack of one DRAM circuit) as well as stacks that may not require buffering.

Embodiments Involving DRAM Power Management Latencies

In various embodiments, power management schemes may be utilized for one-circuit-to-many-bank, one-circuit-to-one-bank, and many-circuit-to-one-bank configurations. Memory (e.g. DRAM) circuits may provide external control inputs for power management. In DDR2 SDRAM, for example, power management may be initiated using the CKE and chip select (CS#) inputs and optionally in combination with a command to place the DDR2 SDRAM in various power down modes.

Four power saving modes for DDR2 SDRAM may be utilized, in accordance with various different embodiments (or even in combination, in other embodiments). In particular, two active power down modes, precharge power down mode, and self-refresh mode may be utilized. If CKE is de-asserted while CS# is asserted, the DDR2 SDRAM may enter an active or precharge power down mode. If CKE is de-asserted while CS# is asserted in combination with the refresh command, the DDR2 SDRAM may enter the self refresh mode.

If power down occurs when there are no rows active in any bank, the DDR2 SDRAM may enter precharge power down mode. If power down occurs when there is a row active in any bank, the DDR2 SDRAM may enter one of the two active power down modes. The two active power down modes may include fast exit active power down mode or slow exit active power down mode.

The selection of fast exit mode or slow exit mode may be determined by the configuration of a mode register. The maximum duration for either the active power down mode or the precharge power down mode may be limited by the refresh requirements of the DDR2 SDRAM and may further be equal to tRFC(MAX).

DDR2 SDRAMs may require CKE to remain stable for a minimum time of tCKE(MIN). DDR2 SDRAMs may also require a minimum time of tXP(MIN) between exiting precharge power down mode or active power down mode and a subsequent non-read command. Furthermore, DDR2 SDRAMs may also require a minimum time of tXARD(MIN) between exiting active power down mode (e.g. fast exit) and a subsequent read command. Similarly, DDR2 SDRAMs may require a minimum time of tXARDS(MIN) between exiting active power down mode (e.g. slow exit) and a subsequent read command.

Just by way of example, power management for a DDR2 SDRAM may require that the SDRAM remain in a power down mode for a minimum of three clock cycles [e.g. tCKE(MIN)=3 clocks]. Thus, the SDRAM may require a power down entry latency of three clock cycles.

Also as an example, a DDR2 SDRAM may also require a minimum of two clock cycles between exiting a power down mode and a subsequent command [e.g. tXP(MIN)=2 clock cycles; tXARD(MIN)=2 clock cycles]. Thus, the SDRAM may require a power down exit latency of two clock cycles.

Of course, for other DRAM or memory technologies, the power down entry latency and power down exit latency may be different, but this does not necessarily affect the operation of power management described here.

Accordingly, in the case of DDR2 SDRAM, a minimum total of five clock cycles may be required to enter and then immediately exit a power down mode (e.g. three cycles to satisfy tCKE(min) due to entry latency plus two cycles to satisfy tXP(MIN) or tXARD(MIN) due to exit latency). These five clock cycles may be hidden from the memory controller if power management is not being performed by the controller itself. Of course, it should be noted that other restrictions on the timing of entry and exit from the various power down modes may exist.

In one exemplary embodiment, the minimum power down entry latency for a DRAM circuit may be n clocks. In addition, in the case of DDR2, n=3, three cycles may be required to satisfy tCKE(MIN). Also, the minimum power down exit latency of a DRAM circuit may be x clocks. In the case of DDR2, x=2, two cycles may be required to satisfy tXP(MIN) and tXARD(MIN). Thus, the power management latency of a DRAM circuit in the present exemplary embodiment may require a minimum of k=n+x clocks for the DRAM circuit to enter power down mode and exit from power down mode. (e.g. DDR2, k=3+2=5 clock cycles).

DRAM Command Operation Period Embodiments

DRAM operations such as precharge or activate may require a certain period of time to complete. During this time, the DRAM, or portion(s) thereof (e.g. bank, etc.) to which the operation is directed may be unable to perform another operation. For example, a precharge operation in a bank of a DRAM circuit may require a certain period of time to complete (specified as tRP for DDR2).

During tRP and after a precharge operation has been initiated, the memory controller may not necessarily be allowed to direct another operation (e.g. activate, etc.) to the same bank of the DRAM circuit. The period of time between the initiation of an operation and the completion of that operation may thus be a command operation period. Thus, the memory controller may not necessarily be allowed to direct another operation to a particular DRAM circuit or portion thereof during a command operation period of various commands or operations. For example, the command operation period of a precharge operation or command may be equal to tRP. As another example, the command operation period of an activate command may be equal to tRCD.

In general, the command operation period need not be limited to a single command. A command operation period can also be defined for a sequence, combination, or pattern of commands. The power management schemes described herein thus need not be limited to a single command and associated command operation period; the schemes may equally be applied to sequences, patterns, and combinations of commands. It should also be noted that a command may have a first command operation period in a DRAM circuit to which the command is directed to, and also have a second command operation period in another DRAM circuit to which the command is not directed to. The first and second command operation periods need not be the same. In addition, a command may have different command operation periods in different mappings of physical DRAM circuits to the banks of a virtual DRAM circuit, and also under different conditions.

It should be noted that the command operation periods may be specified in nanoseconds. For example, tRP may be specified in nanoseconds, and may vary according to the speed grade of a DRAM circuit. Furthermore, tRP may be defined in JEDEC standards (e.g. currently JEDEC Standard No. 21-C for DDR2 SDRAM). Thus, tRP may be measured as an integer number of clock cycles. Optionally, the tRP may not necessarily be specified to be an exact number clock cycles. For DDR2 SDRAMs, the minimum value of tRP may be equivalent to three clock cycles or more.

In additional exemplary embodiments, power management schemes may be based on an interface circuit identifying at least one memory (e.g. DRAM, etc.) circuit that is not currently being accessed by the system. In response to the identification of the at least one memory circuit, a power saving operation may be initiated in association with the at least one memory circuit.

In one embodiment, such power saving operation may involve a power down operation, and in particular, a precharge power down operation, using the CKE pin of the DRAM circuits (e.g. a CKE power management scheme). Other similar power management schemes using other power down control methods and power down modes, with different commands and alternative memory circuit technologies, may also be used.

If the CKE power-management scheme does not involve the memory controller, then the presence of the scheme may be transparent to the memory controller. Accordingly, the power down entry latency and the power down exit latency may be hidden from the memory controller. In one embodiment, the power down entry and exit latencies may be hidden from the memory controller by opportunistically placing at least one first DRAM circuit into a power down mode and, if required, bringing at least one second DRAM circuit out of power down mode during a command operation period when the at least one first DRAM circuit is not being accessed by the system.

The identification of the appropriate command operation period during which at least one first DRAM circuit in a stack may be placed in power down mode or brought out of power down mode may be based on commands directed to the first DRAM circuit (e.g. based on commands directed to itself) or on commands directed to a second DRAM circuit (e.g. based on commands directed to other DRAM circuits).

In another embodiment, the command operation period of the DRAM circuit may be used to hide the power down entry and/or exit latencies. For example, the existing command operation periods of the physical DRAM circuits may be used to the hide the power down entry and/or exit latencies if the delays associated with one or more operations are long enough to hide the power down entry and/or exit latencies. In yet another embodiment, the command operation period of a virtual DRAM circuit may be used to hide the power down entry and/or exit latencies by making the command operation period of the virtual DRAM circuit longer than the command operation period of the physical DRAM circuits.

Thus, the interface circuit may simulate a plurality of physical DRAM circuits to appear as at least one virtual DRAM circuit with at least one command operation period that is different from that of the physical DRAM circuits. This embodiment may be used if the existing command operation periods of the physical DRAM circuits are not long enough to hide the power down entry and/or exit latencies, thus necessitating the interface circuit to increase the command operation periods by simulating a virtual DRAM circuit with at least one different (e.g. longer, etc.) command operation period from that of the physical DRAM circuits.

Specific examples of different power management schemes in various embodiments are described below for illustrative purposes. It should again be strongly noted that the following information is set forth for illustrative purposes and should not be construed as limiting in any manner.

Row Cycle Time Based Power Management Embodiments

Row cycle time based power management is an example of a power management scheme that uses the command operation period of DRAM circuits to hide power down entry and exit latencies. In one embodiment, the interface circuit may place at least one first physical DRAM circuit into power down mode based on the commands directed to a second physical DRAM circuit. Power management schemes such as a row cycle time based scheme may be best suited for a many-circuit-to-one-bank configuration of DRAM circuits.

As explained previously, in a many-circuit-to-one-bank configuration, a plurality of physical DRAM circuits may be mapped to a single bank of a larger capacity virtual DRAM circuit seen by the memory controller. For example, sixteen 256 Mb DDR2 physical SDRAM circuits may appear to the memory controller as a single 4 Gb virtual DDR2 SDRAM circuit. Since a 4 Gb DDR2 SDRAM circuit is specified by the JEDEC standards to have eight physical banks, two of the 256 Mb DDR2 physical SDRAM circuits may be mapped by the interface circuit to a single bank of the virtual 4 Gb DDR2 SDRAM circuit.

In one embodiment, bank 0 of the virtual 4 Gb DDR2 SDRAM circuit may be mapped by the interface circuit to two 256 Mb DDR2 physical SDRAM circuits (e.g. DRAM A and DRAM B). However, since only one page may be open in a bank of a DRAM circuit (either physical or virtual) at any given time, only one of DRAM A or DRAM B may be in the active state at any given time. If the memory controller issues a first activate (e.g. page open, etc.) command to bank 0 of the 4 Gb virtual DRAM, that command may be directed by the interface circuit to either DRAM A or DRAM B, but not to both.

In addition, the memory controller may be unable to issue a second activate command to bank 0 of the 4 Gb virtual DRAM until a period tRC has elapsed from the time the first activate command was issued by the memory controller. In this instance, the command operation period of an activate command may be tRC. The parameter tRC may be much longer than the power down entry and exit latencies.

Therefore, if the first activate command is directed by the interface circuit to DRAM A, then the interface circuit may place DRAM B in the precharge power down mode during the activate command operation period (e.g. for period tRC). As another option, if the first activate command is directed by the interface circuit to DRAM B, then it may place DRAM A in the precharge power down mode during the command operation period of the first activate command. Thus, if p physical DRAM circuits (where p is greater than 1) are mapped to a single bank of a virtual DRAM circuit, then at least p−1 of the p physical DRAM circuits may be subjected to a power saving operation. The power saving operation may, for example, comprise operating in precharge power down mode except when refresh is required. Of course, power savings may also occur in other embodiments without such continuity.

Row Precharge Time Based Power Management Embodiments

Row precharge time based power management is an example of a power management scheme that, in one embodiment, uses the precharge command operation period (that is the command operation period of precharge commands, tRP) of physical DRAM circuits to hide power down entry and exit latencies. In another embodiment, a row precharge time based power management scheme may be implemented that uses the precharge command operation period of virtual DRAM circuits to hide power down entry and exit latencies. In these schemes, the interface circuit may place at least one DRAM circuit into power down mode based on commands directed to the same at least one DRAM circuit. Power management schemes such as the row precharge time based scheme may be best suited for many-circuit-to-one-bank and one-circuit-to-one-bank configurations of physical DRAM circuits. A row precharge time based power management scheme may be particularly efficient when the memory controller implements a closed page policy.

A row precharge time based power management scheme may power down a physical DRAM circuit after a precharge or autoprecharge command closes an open bank. This power management scheme allows each physical DRAM circuit to enter power down mode when not in use. While the specific memory circuit technology used in this example is DDR2 and the command used here is the precharge or autoprecharge command, the scheme may be utilized in any desired context. This power management scheme uses an algorithm to determine if there is any required delay as well as the timing of the power management in terms of the command operation period.

In one embodiment, if the tRP of a physical DRAM circuit [tRP(physical)] is larger than k (where k is the power management latency), then the interface circuit may place that DRAM circuit into precharge power down mode during the command operation period of the precharge or autoprecharge command. In this embodiment, the precharge power down mode may be initiated following the precharge or autoprecharge command to the open bank in that physical DRAM circuit. Additionally, the physical DRAM circuit may be brought out of precharge power down mode before the earliest time a subsequent activate command may arrive at the inputs of the physical DRAM circuit. Thus, the power down entry and power down exit latencies may be hidden from the memory controller.

In another embodiment, a plurality of physical DRAM circuits may appear to the memory controller as at least one larger capacity virtual DRAM circuit with a tRP(virtual) that is larger than that of the physical DRAM circuits [e.g. larger than tRP(physical)]. For example, the physical DRAM circuits may, through simulation, appear to the memory controller as a larger capacity virtual DRAM with tRP(virtual) equal to tRP(physical)+m, where m may be an integer multiple of the clock cycle, or may be a non-integer multiple of the clock cycle, or may be a constant or variable multiple of the clock cycle, or may be less than one clock cycle, or may be zero. Note that m may or may not be equal to j. If tRP(virtual) is larger than k, then the interface circuit may place a physical DRAM circuit into precharge power down mode in a subsequent clock cycle after a precharge or autoprecharge command to the open bank in the physical DRAM circuit has been received by the physical DRAM circuit. Additionally, the physical DRAM circuit may be brought out of precharge power down mode before the earliest time a subsequent activate command may arrive at the inputs of the physical DRAM circuit. Thus, the power down entry and power down exit latency may be hidden from the memory controller.

In yet another embodiment, the interface circuit may make the stack of physical DRAM circuits appear to the memory controller as at least one larger capacity virtual DRAM circuit with tRP(virtual) and tRCD(virtual) that are larger than that of the physical DRAM circuits in the stack [e.g. larger than tRP(physical) and tRCD(physical) respectively, where tRCD(physical) is the tRCD of the physical DRAM circuits]. For example, the stack of physical DRAM circuits may appear to the memory controller as a larger capacity virtual DRAM with tRP(virtual) and tRCD(virtual) equal to [tRP(physical)+m] and tRCD(physical)+1] respectively. Similar to m, 1 may be an integer multiple of the clock cycle, or may be a non-integer multiple of the clock cycle, or may be constant or variable multiple of the clock cycle, or may be less than a clock cycle, or may be zero. Also, 1 may or may not be equal to j and/or m. In this embodiment, if tRP(virtual) is larger than n (where n is the power down entry latency defined earlier), and if 1 is larger than or equal to x (where x is the power down exit latency defined earlier), then the interface circuit may use the following sequence of events to implement a row precharge time based power management scheme and also hide the power down entry and exit latencies from the memory controller.

First, when a precharge or autoprecharge command is issued to an open bank in a physical DRAM circuit, the interface circuit may place that physical DRAM circuit into precharge power down mode in a subsequent clock cycle after the precharge or autoprecharge command has been received by that physical DRAM circuit. The interface circuit may continue to keep the physical DRAM circuit in the precharge power down mode until the interface circuit receives a subsequent activate command to that physical DRAM circuit.

Second, the interface circuit may then bring the physical DRAM circuit out of precharge power down mode by asserting the CKE input of the physical DRAM in a following clock cycle. The interface circuit may also delay the address and control signals associated with the activate command for a minimum of x clock cycles before sending the signals associated with the activate command to the physical DRAM circuit.

The row precharge time based power management scheme described above is suitable for many-circuit-to-one-bank and one-circuit-to-one-bank configurations since there is a guaranteed minimum period of time (e.g. a keep-out period) of at least tRP(physical) after a precharge command to a physical DRAM circuit during which the memory controller will not issue a subsequent activate command to the same physical DRAM circuit. In other words, the command operation period of a precharge command applies to the entire DRAM circuit. In the case of one-circuit-to-many-bank configurations, there is no guarantee that a precharge command to a first portion(s) (e.g. bank) of a physical DRAM circuit will not be immediately followed by an activate command to a second portion(s) (e.g. bank) of the same physical DRAM circuit. In this case, there is no keep-out period to hide the power down entry and exit latencies. In other words, the command operation period of a precharge command applies only to a portion of the physical DRAM circuit.

For example, four 512 Mb physical DDR2 SDRAM circuits through simulation may appear to the memory controller as a single 2 Gb virtual DDR2 SDRAM circuit with eight banks Therefore, the interface circuit may map two banks of the 2 Gb virtual DRAM circuit to each 512 Mb physical DRAM circuit. Thus, banks 0 and 1 of the 2 Gb virtual DRAM circuit may be mapped to a single 512 Mb physical DRAM circuit (e.g. DRAM C). In addition, bank 0 of the virtual DRAM circuit may have an open page while bank 1 of the virtual DRAM circuit may have no open page.

When the memory controller issues a precharge or autoprecharge command to bank 0 of the 2 Gb virtual DRAM circuit, the interface circuit may signal DRAM C to enter the precharge power down mode after the precharge or autoprecharge command has been received by DRAM C. The interface circuit may accomplish this by de-asserting the CKE input of DRAM C during a clock cycle subsequent to the clock cycle in which DRAM C received the precharge or autoprecharge command. However, the memory controller may issue an activate command to the bank 1 of the 2 Gb virtual DRAM circuit on the next clock cycle after it issued the precharge command to bank 0 of the virtual DRAM circuit.

However, DRAM C may have just entered a power down mode and may need to exit power down immediately. As described above, a DDR2 SDRAM may require a minimum of k=5 clock cycles to enter a power down mode and immediately exit the power down mode. In this example, the command operation period of the precharge command to bank 0 of the 2 Gb virtual DRAM circuit may not be sufficiently long enough to hide the power down entry latency of DRAM C even if the command operation period of the activate command to bank 1 of the 2 Gb virtual DRAM circuit is long enough to hide the power down exit latency of DRAM C, which would then cause the simulated 2 Gb virtual DRAM circuit to not be in compliance with the DDR2 protocol. It is therefore difficult, in a simple fashion, to hide the power management latency during the command operation period of precharge commands in a one-circuit-to-many-bank configuration.

Row Activate Time Based Power Management Embodiments

Row activate time based power management is a power management scheme that, in one embodiment, may use the activate command operation period (that is the command operation period of activate commands) of DRAM circuits to hide power down entry latency and power down exit latency.

In a first embodiment, a row activate time based power management scheme may be used for one-circuit-to-many-bank configurations. In this embodiment, the power down entry latency of a physical DRAM circuit may be hidden behind the command operation period of an activate command directed to a different physical DRAM circuit. Additionally, the power down exit latency of a physical DRAM circuit may be hidden behind the command operation period of an activate command directed to itself. The activate command operation periods that are used to hide power down entry and exit latencies may be tRRD and tRCD respectively.

In a second embodiment, a row activate time based power management scheme may be used for many-circuit-to-one-bank and one-circuit-to-one-bank configurations. In this embodiment, the power down entry and exit latencies of a physical DRAM circuit may be hidden behind the command operation period of an activate command directed to itself. In this embodiment, the command operation period of an activate command may be tRCD.

In the first embodiment, a row activate time based power management scheme may place a first DRAM circuit that has no open banks into a power down mode when an activate command is issued to a second DRAM circuit if the first and second DRAM circuits are part of a plurality of physical DRAM circuits that appear as a single virtual DRAM circuit to the memory controller. This power management scheme may allow each DRAM circuit to enter power down mode when not in use. This embodiment may be used in one-circuit-to-many-bank configurations of DRAM circuits. While the specific memory circuit technology used in this example is DDR2 and the command used here is the activate command, the scheme may be utilized in any desired context. The scheme uses an algorithm to determine if there is any required delay as well as the timing of the power management in terms of the command operation period.

In a one-circuit-to-many-bank configuration, a plurality of banks of a virtual DRAM circuit may be mapped to a single physical DRAM circuit. For example, four 512 Mb DDR2 SDRAM circuits through simulation may appear to the memory controller as a single 2 Gb virtual DDR2 SDRAM circuit with eight banks Therefore, the interface circuit may map two banks of the 2 Gb virtual DRAM circuit to each 512 Mb physical DRAM circuit. Thus, banks 0 and 1 of the 2 Gb virtual DRAM circuit may be mapped to a first 512 Mb physical DRAM circuit (e.g. DRAM P). Similarly, banks 2 and 3 of the 2 Gb virtual DRAM circuit may be mapped to a second 512 Mb physical DRAM circuit (e.g. DRAM Q), banks 4 and 5 of the 2 Gb virtual DRAM circuit may be mapped to a third 512 Mb physical DRAM circuit (e.g. DRAM R), and banks 6 and 7 of the 2 Gb virtual DRAM circuit may be mapped to a fourth 512 Mb physical DRAM circuit (e.g. DRAM S).

In addition, bank 0 of the virtual DRAM circuit may have an open page while all the other banks of the virtual DRAM circuit may have no open pages. When the memory controller issues a precharge or autoprecharge command to bank 0 of the 2 Gb virtual DRAM circuit, the interface circuit may not be able to place DRAM P in precharge power down mode after the precharge or autoprecharge command has been received by DRAM P. This may be because the memory controller may issue an activate command to bank 1 of the 2 Gb virtual DRAM circuit in the very next cycle. As described previously, a row precharge time based power management scheme may not be used in a one-circuit-to-many-bank configuration since there is no guaranteed keep-out period after a precharge or autoprecharge command to a physical DRAM circuit.

However, since physical DRAM circuits DRAM P, DRAM Q, DRAM R, and DRAM S all appear to the memory controller as a single 2 Gb virtual DRAM circuit, the memory controller may ensure a minimum period of time, tRRD(MIN), between activate commands to the single 2 Gb virtual DRAM circuit. For DDR2 SDRAMs, the active bank N to active bank M command period tRRD may be variable with a minimum value of tRRD(MIN) (e.g. 2 clock cycles, etc.).

The parameter tRRD may be specified in nanoseconds and may be defined in JEDEC Standard No. 21-C. For example, tRRD may be measured as an integer number of clock cycles. Optionally, tRRD may not be specified to be an exact number of clock cycles. The tRRD parameter may mean an activate command to a second bank B of a DRAM circuit (either physical DRAM circuit or virtual DRAM circuit) may not be able to follow an activate command to a first bank A of the same DRAM circuit in less than tRRD clock cycles.

If tRRD(MIN)=n (where n is the power down entry latency), a first number of physical DRAM circuits that have no open pages may be placed in power down mode when an activate command is issued to another physical DRAM circuit that through simulation is part of the same virtual DRAM circuit. In the above example, after a precharge or autoprecharge command has closed the last open page in DRAM P, the interface circuit may keep DRAM P in precharge standby mode until the memory controller issues an activate command to one of DRAM Q, DRAM R, and DRAM S. When the interface circuit receives the abovementioned activate command, it may then immediately place DRAM P into precharge power down mode if tRRD(MIN)≧n.

Optionally, when one of the interface circuits is a register, the above power management scheme may be used even if tRRD(MIN)<n as long as tRRD(MIN)=n−1. In this optional embodiment, the additional typical one clock cycle delay through a JEDEC register helps to hide the power down entry latency if tRRD(MIN) by itself is not sufficiently long to hide the power down entry latency.

The above embodiments of a row activate time power management scheme require 1 to be larger than or equal to x (where x is the power down exit latency) so that when the memory controller issues an activate command to a bank of the virtual DRAM circuit, and if the corresponding physical DRAM circuit is in precharge power down mode, the interface circuit can hide the power down exit latency of the physical DRAM circuit behind the row activate time tRCD of the virtual DRAM circuit. The power down exit latency may be hidden because the interface circuit may simulate a plurality of physical DRAM circuits as a larger capacity virtual DRAM circuit with tRCD(virtual)=tRCD(physical)+1, where tRCD(physical) is the tRCD of the physical DRAM circuits.

Therefore, when the interface circuit receives an activate command that is directed to a DRAM circuit that is in precharge power down mode, it will delay the activate command by at least x clock cycles while simultaneously bringing the DRAM circuit out of power down mode. Since 1≧x, the command operation period of the activate command may overlap the power down exit latency, thus allowing the interface circuit to hide the power down exit latency behind the row activate time.

Using the same example as above, DRAM P may be placed into precharge power down mode after the memory controller issued a precharge or autoprecharge command to the last open page in DRAM P and then issued an activate command to one of DRAM Q, DRAM R, and DRAM S. At a later time, when the memory controller issues an activate command to DRAM P, the interface circuit may immediately bring DRAM P out of precharge power down mode while delaying the activate command to DRAM P by at least x clock cycles. Since 1≧x, DRAM P may be ready to receive the delayed activate command when the interface circuit sends the activate command to DRAM P.

For many-circuit-to-one-bank and one-circuit-to-one-bank configurations, another embodiment of the row activate time based power management scheme may be used. For both many-circuit-to-one-bank and one-circuit-to-one-bank configurations, an activate command to a physical DRAM circuit may have a keep-out or command operation period of at least tRCD(virtual) clock cycles [tRCD(virtual)=tRCD(physical)+1]. Since each physical DRAM circuit is mapped to one bank (or portion(s) thereof) of a larger capacity virtual DRAM circuit, it may be certain that no command may be issued to a physical DRAM circuit for a minimum of tRCD(virtual) clock cycles after an activate command has been issued to the physical DRAM circuit.

If tRCD(physical) or tRCD(virtual) is larger than k (where k is the power management latency), then the interface circuit may place the physical DRAM circuit into active power down mode on the clock cycle after the activate command has been received by the physical DRAM circuit and bring the physical DRAM circuit out of active power down mode before the earliest time a subsequent read or write command may arrive at the inputs of the physical DRAM circuit. Thus, the power down entry and power down exit latencies may be hidden from the memory controller.

The command and power down mode used for the activate command based power-management scheme may be the activate command and precharge or active power down modes, but other similar power down schemes may use different power down modes, with different commands, and indeed even alternative DRAM circuit technologies may be used.

Refresh Cycle Time Based Power Management Embodiments

Refresh cycle time based power management is a power management scheme that uses the refresh command operation period (that is the command operation period of refresh commands) of virtual DRAM circuits to hide power down entry and exit latencies. In this scheme, the interface circuit places at least one physical DRAM circuit into power down mode based on commands directed to a different physical DRAM circuit. A refresh cycle time based power management scheme that uses the command operation period of virtual DRAM circuits may be used for many-circuit-to-one-bank, one-circuit-to-one-bank, and one-circuit-to-many-bank configurations.

Refresh commands to a DRAM circuit may have a command operation period that is specified by the refresh cycle time, tRFC. The minimum and maximum values of the refresh cycle time, tRFC, may be specified in nanoseconds and may further be defined in the JEDEC standards (e.g. JEDEC Standard No. 21-C for DDR2 SDRAM, etc.). In one embodiment, the minimum value of tRFC [e.g. tRFC(MIN)] may vary as a function of the capacity of the DRAM circuit. Larger capacity DRAM circuits may have larger values of tRFC(MIN) than smaller capacity DRAM circuits. The parameter tRFC may be measured as an integer number of clock cycles, although optionally the tRFC may not be specified to be an exact number clock cycles.

A memory controller may initiate refresh operations by issuing refresh control signals to the DRAM circuits with sufficient frequency to prevent any loss of data in the DRAM circuits. After a refresh command is issued to a DRAM circuit, a minimum time (e.g. denoted by tRFC) may be required to elapse before another command may be issued to that DRAM circuit. In the case where a plurality of physical DRAM circuits through simulation by an interface circuit may appear to the memory controller as at least one larger capacity virtual DRAM circuit, the command operation period of the refresh commands (e.g. the refresh cycle time, tRFC) from the memory controller may be larger than that required by the DRAM circuits. In other words, tRFC(virtual)>tRFC(physical), where tRFC(physical) is the refresh cycle time of the smaller capacity physical DRAM circuits.

When the interface circuit receives a refresh command from the memory controller, it may refresh the smaller capacity physical DRAM circuits within the span of time specified by the tRFC associated with the larger capacity virtual DRAM circuit. Since the tRFC of the virtual DRAM circuit may be larger than that of the associated physical DRAM circuits, it may not be necessary to issue refresh commands to all of the physical DRAM circuits simultaneously. Refresh commands may be issued separately to individual physical DRAM circuits or may be issued to groups of physical DRAM circuits, provided that the tRFC requirement of the physical DRAM circuits is satisfied by the time the tRFC of the virtual DRAM circuit has elapsed.

In one exemplary embodiment, the interface circuit may place a physical DRAM circuit into power down mode for some period of the tRFC of the virtual DRAM circuit when other physical DRAM circuits are being refreshed. For example, four 512 Mb physical DRAM circuits (e.g. DRAM W, DRAM X, DRAM Y, DRAM Z) through simulation by an interface circuit may appear to the memory controller as a 2 Gb virtual DRAM circuit. When the memory controller issues a refresh command to the 2 Gb virtual DRAM circuit, it may not issue another command to the 2 Gb virtual DRAM circuit at least until a period of time, tRFC(MIN)(virtual), has elapsed.

Since the tRFC(MIN)(physical) of the 512 Mb physical DRAM circuits (DRAM W, DRAM X, DRAM Y, and DRAM Z) may be smaller than the tRFC(MIN)(virtual) of the 2 Gb virtual DRAM circuit, the interface circuit may stagger the refresh commands to DRAM W, DRAM X, DRAM Y, DRAM Z such that that total time needed to refresh all the four physical DRAM circuits is less than or equal to the tRFC(MIN)(virtual) of the virtual DRAM circuit. In addition, the interface circuit may place each of the physical DRAM circuits into precharge power down mode either before or after the respective refresh operations.

For example, the interface circuit may place DRAM Y and DRAM Z into power down mode while issuing refresh commands to DRAM W and DRAM X. At some later time, the interface circuit may bring DRAM Y and DRAM Z out of power down mode and issue refresh commands to both of them. At a still later time, when DRAM W and DRAM X have finished their refresh operation, the interface circuit may place both of them in a power down mode. At a still later time, the interface circuit may optionally bring DRAM W and DRAM X out of power down mode such that when DRAM Y and DRAM Z have finished their refresh operations, all four DRAM circuits are in the precharge standby state and ready to receive the next command from the memory controller. In another example, the memory controller may place DRAM W, DRAM X, DRAM Y, and DRAM Z into precharge power down mode after the respective refresh operations if the power down exit latency of the DRAM circuits may be hidden behind the command operation period of the activate command of the virtual 2 Gb DRAM circuit.

FB-DIMM Power Management Embodiments

FIG. 24 shows a memory system 2400 comprising FB-DIMM modules using DRAM circuits with AMB chips, in accordance with another embodiment. As an option, the memory system 2400 may be implemented in the context of the architecture and environment of FIGS. 19-23. Of course, however, the memory system 2400 may be implemented in any desired environment. It should also be noted that the aforementioned definitions may apply during the present description.

As described herein, the memory circuit power management scheme may be associated with an FB-DIMM memory system that uses DDR2 SDRAM circuits. However, other memory circuit technologies such as DDR3 SDRAM, Mobile DDR SDRAM, etc. may provide similar control inputs and modes for power management and the example described in this section can be used with other types of buffering schemes and other memory circuit technologies. Therefore, the description of the specific example should not be construed as limiting in any manner.

In an FB-DIMM memory system 2400, a memory controller 2402 may place commands and write data into frames and send the frames to interface circuits (e.g. AMB chip 2404, etc.). Further, in the FB-DIMM memory system 2400, there may be one AMB chip 2404 on each of a plurality of DIMMs 2406A-C. For the memory controller 2402 to address and control DRAM circuits, it may issue commands that are placed into frames.

The command frames or command and data frames may then be sent by the memory controller 2402 to the nearest AMB chip 2404 through a dedicated outbound path, which may be denoted as a southbound lane. The AMB chip 2404 closest to the memory controller 2402 may then relay the frames to the next AMB chip 2404 via its own southbound lane. In this manner, the frames may be relayed to each AMB chip 2404 in the FB-DIMM memory channel.

In the process of relaying the frames, each AMB chip 2404 may partially decode the frames to determine if a given frame contains commands targeted to the DRAM circuits on that the associated DIMM 2406A-C. If a frame contains a read command addressed to a set of DRAM circuits on a given DIMM 2406A-C, the AMB chip 2404 on the associated DIMM 2406A-C accesses DRAM circuits 2408 to retrieve the requested data. The data may be placed into frames and returned to the memory controller 2402 through a similar frame relay process on the northbound lanes as that described for the southbound lanes.

Two classes of scheduling algorithms may be utilized for AMB chips 2404 to return data frames to the memory controller 2402, including variable-latency scheduling and fixed-latency scheduling. With respect to variable latency scheduling, after a read command is issued to the DRAM circuits 2408, the DRAM circuits 2408 return data to the AMB chip 2404. The AMB chip 2404 then constructs a data frame, and as soon as it can, places the data frame onto the northbound lanes to return the data to the memory controller 2402. The variable latency scheduling algorithm may ensure the shortest latency for any given request in the FB-DIMM channel.

However, in the variable latency scheduling algorithm, DRAM circuits 2408 located on the DIMM (e.g. the DIMM 2406A, etc.) that is closest to the memory controller 2402 may have the shortest access latency, while DRAM circuits 2408 located on the DIMM (e.g. the DIMM 2406C, etc.) that is at the end of the channel may have the longest access latency. As a result, the memory controller 2402 may be sophisticated, such that command frames may be scheduled appropriately to ensure that data return frames do not collide on the northbound lanes.

In a FB-DIMM memory system 2400 with only one or two DIMMs 2406A-C, variable latency scheduling may be easily performed since there may be limited situations where data frames may collide on the northbound lanes. However, variable latency scheduling may be far more difficult if the memory controller 2402 has to be designed to account for situations where the FB-DIMM channel can be configured with one DIMM, eight DIMMs, or any other number of DIMMs. Consequently, the fixed latency scheduling algorithm may be utilized in an FB-DIMM memory system 2400 to simplify memory controller design.

In the fixed latency scheduling algorithm, every DIMM 2406A-C is configured to provide equal access latency from the perspective of the memory controller 2402. In such a case, the access latency of every DIMM 2406A-C may be equalized to the access latency of the slowest-responding DIMM (e.g. the DIMM 2406C, etc.). As a result, the AMB chips 2404 that are not the slowest responding AMB chip 2404 (e.g. the AMB chip 2404 of the DIMM 2406C, etc.) may be configured with additional delay before it can upload the data frames into the northbound lanes.

From the perspective of the AMB chips 2404 that are not the slowest responding AMB chip 2404 in the system, data access occurs as soon as the DRAM command is decoded and sent to the DRAM circuits 2408. However, the AMB chips 2404 may then hold the data for a number of cycles before this data is returned to the memory controller 2402 via the northbound lanes. The data return delay may be different for each AMB chip 2404 in the FB-DIMM channel.

Since the role of the data return delay is to equalize the memory access latency for each DIMM 2406A-C, the data return delay value may depend on the distance of the DIMM 2406A-C from the memory controller 2402 as well as the access latency of the DRAM circuits 2408 (e.g. the respective delay values may be computed for each AMB chip 2404 in a given FB-DIMM channel, and programmed into the appropriate AMB chip 2404.

In the context of the memory circuit power management scheme, the AMB chips 2404 may use the programmed delay values to perform differing classes of memory circuit power management algorithms. In cases where the programmed data delay value is larger than k=n+x, where n is the minimum power down entry latency, x is the minimum power down exit latency, and k is the cumulative sum of the two, the AMB chip 2404 can provide aggressive power management before and after every command. In particular, the large delay value ensures that the AMB chip 2404 can place DRAM circuits 2408 into power down modes and move them to active modes as needed.

In the cases where the programmed data delay value is smaller than k, but larger than x, the AMB chip 2404 can place DRAM circuits 2408 into power down modes selectively after certain commands, as long as these commands provide the required command operation periods to hide the minimum power down entry latency. For example, the AMB chip 2404 can choose to place the DRAM circuits 2408 into a power down mode after a refresh command, and the DRAM circuits 2408 can be kept in the power down mode until a command is issued by the memory controller 2402 to access the specific set of DRAM circuits 2408. Finally, in cases where the programmed data delay is smaller than x, the AMB chip 2404 may choose to implement power management algorithms to a selected subset of DRAM circuits 2408.

There are various optional characteristics and benefits available when using CKE power management in FB-DIMMs. First, there is not necessarily a need for explicit CKE commands, and therefore there is not necessarily a need to use command bandwidth. Second, granularity is provided, such that CKE power management will power down DRAM circuits as needed in each DIMM. Third, the CKE power management can be most aggressive in the DIMM that is closest to the controller (e.g. the DIMM closest to the memory controller which contains the AMB chip that consumes the highest power because of the highest activity rates).

Other Embodiments

While many examples of power management schemes for memory circuits have been described above, other implementations are possible. For DDR2, for example, there may be approximately 15 different commands that could be used with a power management scheme. The above descriptions allow each command to be evaluated for suitability and then appropriate delays and timing may be calculated. For other memory circuit technologies, similar power saving schemes and classes of schemes may be derived from the above descriptions.

The schemes described are not limited to be used by themselves. For example, it is possible to use a trigger that is more complex than a single command in order to initiate power management. In particular, power management schemes may be initiated by the detection of combinations of commands, or patterns of commands, or by the detection of an absence of commands for a certain period of time, or by any other mechanism.

Power management schemes may also use multiple triggers including forming a class of power management schemes using multiple commands or multiple combinations of commands. Power management schemes may also be used in combination. Thus, for example, a row precharge time based power management scheme may be used in combination with a row activate time command based power management scheme.

The description of the power management schemes in the above sections has referred to an interface circuit in order to perform the act of signaling the DRAM circuits and for introducing delay if necessary. An interface circuit may optionally be a part of the stack of DRAM circuits. Of course, however, the interface circuit may also be separate from the stack of DRAM circuits. In addition, the interface circuit may be physically located anywhere in the stack of DRAM circuits, where such interface circuit electrically sits between the electronic system and the stack of DRAM circuits.

In one implementation, for example, the interface circuit may be split into several chips that in combination perform the power management functions described. Thus, for example, there may be a single register chip that electrically sits between the memory controller and a number of stacks of DRAM circuits. The register chip may optionally perform the signaling to the DRAM circuits.

The register chip may further be connected electrically to a number of interface circuits that sit electrically between the register chip and a stack of DRAM circuits. The interface circuits in the stacks of DRAM circuits may then perform the required delay if it is needed. In another implementation there may be no need for an interface circuit in each DRAM stack. In that case, the register chip can perform the signaling to the DRAM circuits directly. In yet another implementation, a plurality of register chips and buffer chips may sit electrically between the stacks of DRAM circuits and the system, where both the register chips and the buffer chips perform the signaling to the DRAM circuits as well as delaying the address, control, and data signals to the DRAM circuits. In another implementation there may be no need for a stack of DRAM circuits. Thus each stack may be a single memory circuit.

Further, the power management schemes described for the DRAM circuits may also be extended to the interface circuits. For example, the interface circuits have information that a signal, bus, or other connection will not be used for a period of time. During this period of time, the interface circuits may perform power management on themselves, on other interface circuits, or cooperatively. Such power management may, for example, use an intelligent signaling mechanism (e.g. encoded signals, sideband signals, etc.) between interface circuits (e.g. register chips, buffer chips, AMB chips, etc.).

It should thus be clear that the power management schemes described here are by way of specific examples for a particular technology, but that the methods and techniques are very general and may be applied to any memory circuit technology to achieve control over power behavior including, for example, the realization of power consumption savings and management of current consumption behavior.

DRAM Circuit Configuration Verification Embodiments

In the various embodiments described above, it may be desirable to verify that the simulated DRAM circuit including any power management scheme or CAS latency simulation or any other simulation behaves according to a desired DRAM standard or other design specification. A behavior of many DRAM circuits is specified by the JEDEC standards and it may be desirable, in some embodiments, to exactly simulate a particular JEDEC standard DRAM. The JEDEC standard may define control signals that a DRAM circuit must accept and the behavior of the DRAM circuit as a result of such control signals. For example, the JEDEC specification for a DDR2 SDRAM may include JESD79-2B (and any associated revisions).

If it is desired, for example, to determine whether a JEDEC standard is met, an algorithm may be used. Such algorithm may check, using a set of software verification tools for formal verification of logic, that protocol behavior of the simulated DRAM circuit is the same as a desired standard or other design specification. This formal verification may be feasible because the DRAM protocol described in a DRAM standard may, in various embodiments, be limited to a few protocol commands (e.g. approximately 15 protocol commands in the case of the JEDEC DDR2 specification, for example).

Examples of the aforementioned software verification tools include MAGELLAN supplied by SYNOPSYS, or other software verification tools, such as INCISIVE supplied by CADENCE, verification tools supplied by JASPER, VERIX supplied by REAL INTENT, 0-IN supplied by MENTOR CORPORATION, etc. These software verification tools may use written assertions that correspond to the rules established by the DRAM protocol and specification.

The written assertions may be further included in code that forms the logic description for the interface circuit. By writing assertions that correspond to the desired behavior of the simulated DRAM circuit, a proof may be constructed that determines whether the desired design requirements are met. In this way, one may test various embodiments for compliance with a standard, multiple standards, or other design specification.

For example, assertions may be written that there are no conflicts on the address bus, command bus or between any clock, control, enable, reset or other signals necessary to operate or associated with the interface circuits and/or DRAM circuits. Although one may know which of the various interface circuit and DRAM stack configurations and address mappings that have been described herein are suitable, the aforementioned algorithm may allow a designer to prove that the simulated DRAM circuit exactly meets the required standard or other design specification. If, for example, an address mapping that uses a common bus for data and a common bus for address results in a control and clock bus that does not meet a required specification, alternative designs for the interface circuit with other bus arrangements or alternative designs for the interconnect between the components of the interface circuit may be used and tested for compliance with the desired standard or other design specification.

While the foregoing is directed to embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof. 

What is claimed is:
 1. A sub-system, comprising: an interface circuit adapted for coupling with a plurality of physical memory circuits and a system, the interface circuit configured to: interface the plurality of physical memory circuits and the system for emulating a virtual memory circuit having a refresh cycle time constraint that is different from a refresh cycle time constraint of the plurality of physical memory circuits; receive a refresh command from the system directed to the virtual memory circuit; in response to receiving the refresh command from the system directed to the virtual memory circuit, determine a staggered sequence of timings for issuing refresh commands to the plurality of physical memory circuits, wherein the staggered sequence of timings satisfy the refresh cycle time constraint of the virtual memory circuit and the refresh cycle time constraint of the plurality of physical memory circuits; and based on the staggered sequence of timings, issue refresh commands to respective physical memory circuits, wherein the refresh cycle time constraint of the virtual memory circuit specifies a minimum refresh cycle time (tRFC) associated with the virtual memory circuit, and wherein the first refresh command and the second refresh command are issued within a span of time specified by the tRFC associated with the virtual memory circuit.
 2. The sub-system of claim 1, wherein interfacing the plurality of physical memory circuits and the system for emulating the virtual memory circuit comprises: using the plurality of physical memory circuits to emulate the virtual memory circuit; and presenting the virtual memory circuit to the system, wherein the virtual memory circuit appears to the system as having the refresh cycle time constraint of the virtual memory circuit.
 3. The sub-system of claim 1, wherein issuing refresh commands to respective physical memory circuits comprises: based on the staggered sequence of timings, issuing a first refresh command to a first physical memory circuit of the plurality of physical memory circuits; and based on the staggered sequence of timings, issuing a second refresh command to a second physical memory circuit of the plurality of physical memory circuits, wherein the first refresh command and the second refresh command are issued at different times.
 4. The sub-system of claim 3, wherein issuing the first refresh command to the first physical memory circuit of the plurality of physical memory circuits comprises issuing the first refresh command to a group of physical memory circuits that include the first physical memory circuit.
 5. The sub-system of claim 3, wherein issuing the first refresh command to the first physical memory circuit of the plurality of physical memory circuits comprises issuing a command to the second physical memory circuit to bring the second physical memory circuit into a power down mode.
 6. The sub-system of claim 3, wherein the interface circuit is configured to: after issuing the first refresh command and the second refresh command, issuing commands to the first physical memory circuit and the second physical memory circuit to bring the first physical memory circuit and the second physical memory circuit into a precharge power down mode.
 7. The sub-system of claim 1, wherein a capacity of the virtual memory circuit is larger than a capacity of each of the plurality of physical memory circuits, and wherein the refresh cycle time constraint of the virtual memory circuit is larger than the refresh cycle time constraint of the plurality of physical memory circuits.
 8. An apparatus, comprising: a plurality of physical memory circuits; an interface circuit electrically coupled to the plurality of physical memory circuits and a system, the interface circuit configured to: interface the plurality of physical memory circuits and the system for emulating a virtual memory circuit having a refresh cycle time constraint that is different from a refresh cycle time constraint of the plurality of physical memory circuits; receive a refresh command from the system directed to the virtual memory circuit; in response to receiving the refresh command from the system directed to the virtual memory circuit, determine a staggered sequence of timings for issuing refresh commands to the plurality of physical memory circuits, wherein the staggered sequence of timings satisfy the refresh cycle time constraint of the virtual memory circuit and the refresh cycle time constraint of the plurality of physical memory circuits; and based on the staggered sequence of timings, issue refresh commands to respective physical memory circuits, wherein the refresh cycle time constraint of the virtual memory circuit specifies a minimum refresh cycle time (tRFC) associated with the virtual memory circuit, and wherein the first refresh command and the second refresh command are issued within a span of time specified by the tRFC associated with the virtual memory circuit.
 9. The apparatus of claim 8, wherein interfacing the plurality of physical memory circuits and the system for emulating the virtual memory circuit comprises: using the plurality of physical memory circuits to emulate the virtual memory circuit; and presenting the virtual memory circuit to the system, wherein the virtual memory circuit appears to the system as having the refresh cycle time constraint of the virtual memory circuit.
 10. The apparatus of claim 8, wherein issuing refresh commands to respective physical memory circuits comprises: based on the staggered sequence of timings, issuing a first refresh command to a first physical memory circuit of the plurality of physical memory circuits; and based on the staggered sequence of timings, issuing a second refresh command to a second physical memory circuit of the plurality of physical memory circuits, wherein the first refresh command and the second refresh command are issued at different times.
 11. The apparatus of claim 10, wherein issuing the first refresh command to the first physical memory circuit of the plurality of physical memory circuits comprises issuing the first refresh command to a group of physical memory circuits that include the first physical memory circuit.
 12. The apparatus of claim 10, wherein issuing the first refresh command to the first physical memory circuit of the plurality of physical memory circuits comprises issuing a command to the second physical memory circuit to bring the second physical memory circuit into a power down mode.
 13. The apparatus of claim 10, wherein the interface circuit is configured to: after issuing the first refresh command and the second refresh command, issuing commands to the first physical memory circuit and the second physical memory circuit to bring the first physical memory circuit and the second physical memory circuit into a precharge power down mode.
 14. The apparatus of claim 8, wherein a capacity of the virtual memory circuit is larger than a capacity of each of the plurality of physical memory circuits, and wherein the refresh cycle time constraint of the virtual memory circuit is larger than the refresh cycle time constraint of the plurality of physical memory circuits.
 15. A method, comprising: interfacing, by an interface circuit, a plurality of physical memory circuits and a system to emulate a virtual memory circuit having a refresh cycle time constraint that is different from a refresh cycle time constraint of the plurality of physical memory circuits; receiving a refresh command from the system directed to the virtual memory circuit; in response to receiving the refresh command from the system directed to the virtual memory circuit, determining a staggered sequence of timings for issuing refresh commands to the plurality of physical memory circuits, wherein the staggered sequence of timings satisfy the refresh cycle time constraint of the virtual memory circuit and the refresh cycle time constraint of the plurality of physical memory circuits; and based on the staggered sequence of timings, issuing refresh commands to respective physical memory circuits, wherein the refresh cycle time constraint of the virtual memory circuit specifies a minimum refresh cycle time (tRFC) associated with the virtual memory circuit, and wherein the first refresh command and the second refresh command are issued within a span of time specified by the tRFC associated with the virtual memory circuit.
 16. The method of claim 15, wherein interfacing the plurality of physical memory circuits and the system to emulate the virtual memory circuit comprises: using the plurality of physical memory circuits to emulate the virtual memory circuit; and presenting the virtual memory circuit to the system, wherein the virtual memory circuit appears to the system as having the refresh cycle time constraint of the virtual memory circuit.
 17. The method of claim 15, wherein issuing refresh commands to respective physical memory circuits comprises: based on the staggered sequence of timings, issuing a first refresh command to a first physical memory circuit of the plurality of physical memory circuits; and based on the staggered sequence of timings, issuing a second refresh command to a second physical memory circuit of the plurality of physical memory circuits, wherein the first refresh command and the second refresh command are issued at different times.
 18. The method of claim 17, wherein the interface circuit is configured to: after issuing the first refresh command and the second refresh command, issuing commands to the first physical memory circuit and the second physical memory circuit to bring the first physical memory circuit and the second physical memory circuit into a precharge power down mode.
 19. The method of claim 15, wherein a capacity of the virtual memory circuit is larger than a capacity of each of the plurality of physical memory circuits, and wherein the refresh cycle time constraint of the virtual memory circuit is larger than the refresh cycle time constraint of the plurality of physical memory circuits.
 20. A sub-system, comprising: an interface circuit adapted for coupling with a plurality of physical memory circuits and a system, the interface circuit configured to: interface the plurality of physical memory circuits and the system for emulating a virtual memory circuit having a refresh cycle time constraint that is different from a refresh cycle time constraint of the plurality of physical memory circuits; receive a refresh command from the system directed to the virtual memory circuit; in response to receiving the refresh command from the system directed to the virtual memory circuit, determine a staggered sequence of timings for issuing refresh commands to the plurality of physical memory circuits, wherein the staggered sequence of timings satisfy the refresh cycle time constraint of the virtual memory circuit and the refresh cycle time constraint of the plurality of physical memory circuits; and based on the staggered sequence of timings, issue refresh commands to respective physical memory circuits, the issuing comprising: based on the staggered sequence of timings, issue a first refresh command to a first physical memory circuit of the plurality of physical memory circuits and issue a command to a second physical memory circuit of the plurality of physical memory circuits to bring the second physical memory circuit into a power down mode; and based on the staggered sequence of timings, issue a second refresh command to the second physical memory circuit, wherein the first refresh command and the second refresh command are issued at different times. 